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Novel method for identifying antibacterial compounds 

The present invention relates to a method for identifying an antagonist or inhibitor 
of the expression of a gene encoding a polypeptide essential for bacterial growth 
or survival as well as for an antagonist or. inhibitor of said polypeptide. The 
invention further relates to a method for improved antagonists or inhibitors. The 
invention also provides an antagonist or inhibitor of the activity of said 
polypeptide. The invention is further related to a method for producing a 
composition comprising said antagonist or inhibitor. Furthermore, the invention is 
related to the use of the polypeptide and the antagonist or inhibitor as well as to a 
method to identify a surrogate marker. 

Several documents are cited throughout the text of this specification. Each of the 
documents cited herein (including any manufacturer's specifications, instructions, 
etc.) are hereby incorporated by reference; however, there is no admission that 
any document cited is indeed prior art of the present invention. 

Since the beginning of the 1980s, a new trend has been observed in the 
industrialized countries. On the one hand, resistances to antibiotics have 
increased, which make it difficult or even impossible to treat many of the disease- 
causing agents. On the other hand, new infectious diseases, which had been 
unknown up to now, arise, and old diseases return. For example, diphteria and 
tuberculosis are old epidemics and increasingly surmounting in many different 
parts of the world. Especially tuberculosis (TB), a chronic infectious disease that is 
generally caused by infection with Mycobacterium tuberculosis, is a disease of 
major concern. Each year, 8 to 10 million new cases of TB are described, and. 
causing more than three million deaths per year. TB is a major disease in 
developing countries as well as an increasing problenn in developed areas of the 
world due to, for example, antibiotic resistance. 
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Additionally, M. bovis BCG vaccination has failed to protect against TB in several 
trials (WHO. Tech. Rep. Ser. (1980), 651. 1-15) for reasons that are not entirely 
dear (Fine. Tubercle 65 (1984), 137-153). It has been shown that the vaccine 
strain of M. bovis BCG only confers protection against the severe form of miliary 
tuberculosis in children (Fine, Lancet 346 (1995), 1339-1345). In contrast, its 
protective capacity against the most common form, pulmonary tuberculosis in 
adults, is low and highly variable (Colditz (1994), JAMA 271 , 698): 

The causes for this new trend are complex: mainly, the increasing number of 
antibiotic applications in medicine and agriculture often combined with an 
improper und uncontrolled use, helps to establish resistant organisms and 
generate the threat of bacterial infections resistant to all available therapies. 
Conventional techniques of developing antibiotics, i.e. synthesis of candidate 
substances and screening for antibacterial substances, even though speeded up 
by several orders of magnitude by the use of combinatorial approaches in recent 
years (e.g. US5324483. US5545568), are still too inefficient as they involve 
multiple screening steps of hundreds or thousands of more or less randomly 
chosen substances for efficiency in combating various infectious. agents. 
Therefore, it is a major concern to' fight the growing number of bacterial infections 
due to an increased frequency of multiple antibiotic resistances and to improve 
the available antibacterial therapies. 

Thus, the technical problem underlying the present invention was to provide a 
method and means for the development of an additional effective antibacterial 
therapy of infected humans and animals that can be used for the treatment of a 
broad spectrum of bacterial infections or diseases or disorders related to bacterial 
infections. The solution to this technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to a method for identifying an antagonist 
or inhibitor of the expression of a gene encoding a polypeptide essential for 
bacterial growth wherein said gene is selected from the group consisting of ygbB, 
yfhC, yacE, ychB, yejD, yrfl, yggJ* yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA, yagF. b1983, yidD. yceG and/or yjbC the sequence 
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of said genes being shown in Fig. 1, or a fragment or derivative or ortholog 
thereof, said method comprising the steps of 

(a) testing a candidate antagonist or inhibitor or a sample comprising a plurality 
of said candidate antagonists or inhibitors for the inhibition or reduction of 
transcription of said gene or a fragment or derivative thereof; or 

(b) testing a candidate antagonist or inhibitor or a sample comprising a plurality 
of said candidate antagonists or inhibitors for the inhibition or reduction of 
translation of mRNA transcribed from said gene or a fragment or derivative 
thereof; and 

(c) identifying an antagonist or inhibitor or a sample comprising a plurality of 
said candidate antagonists or inhibitors that tests positive in step (a) and/or 
(b). 

The term "antagonist" or "inhibitor" as used herein means naturally occurring and 
synthetic compounds capable of counteracting or inhibiting an activity of a gene or 
gene product or interactions of the gene or gene product with other genes or gene 
products. Determining whether a compound is capable of .inhibiting or 
counteracting specific gene expression can be done, for example, by Northern 
blot analysis. Western blot analysis or proteome analysis. It can further be done 
by monitoring the phenotypic characteristics of a bacterial cell contacted with the 
compounds and compare it to that of a wild-type cell. In an additional 
embodiment, said characteristics may be compared to that of a cell contacted with 
a. compound which is either known to be capable or incapable of suppressing or 
activating the protein or gene, respectively, according to the invention. For 
example, the bacterial cell can be a transgenic cell and the phenotypic 
characteristics comprises a readout system. Further examples of determining 
whether a compound is capable of inhibiting or counteracting specific gene 
expression are described below. 

The term "expression" means the production of a protein or nucleotide sequence 
in a cell. However, said term also includes expression of the protein in a cell-free 
system. It includes transcription into an RNA product, and/or translation into a 
polypeptide from a DMA encoding that product. 

The term "transcription" as used herein means , a DNA template dependent 
synthesis of a ribonucleic acid polymer encoding a polypeptide or a regulatory 
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sequence. The term "translation" as used herein means the polymerization of a 
polypeptide that is encoded by an RNA molecule by a protein complex. 
As used in accordance with the present invention, the term "fragment or 
derivative" denotes any variant the amino acid or nucleotide sequence of which 
deviates in its primary structure, e.g., in sequence composition or in length as well 
as to analogue components. For example, one or more amino acids of a 
polypeptide may be replaced in said fragment or derivative as long as the 
modified polypeptides remain functionally equivalent to their described 
counterparts. The term "fragment or derivative" further .denotes compounds 
analog to an antagonist or inhibitor that should have a stabilized electronic 
configuration and molecular conformation that allows key functional groups to be 
presented to the mentioned polypeptide in substantially the same way as the 
antagonist and inhibitor. The variant of the polypeptide may be a naturally 
occurring allelic variant of the polypeptide or non-naturally occurring variants of 
those polynucleotides. 

The term "orthologs" as used herein means homologous sequences in different 
species that evolved from a common ancestoral gene by speciation. Normally, 
orthologs retain the same function in the course of evolution. However, 
orthologous genes may or may not be responsible for a similar function (see, e.g., 
the glossary of the "Trends Guide to Bioinformatics", Trends Supplement 1998, 
Elsevier Science). Orthologous genes, nucleic acids or proteins comprise genes, 
nucleic acids or proteins which have one or more sequences or structural motifs in 
common. For example, the sequence motifs of proteins can comprise short, i.e. 
repetitive sequences or amino acid positions conserved in the primary structure 
and/or conserved in higher protein structures, e.g. secondary or tertiary structure. 
Orthologous nucleic acids or genes can comprise molecules having short 
stretches of one or more homologous (same or similar) sequences, for example 
protein binding boxes or structure forming boxes.. Methods for the identification of 
a candidate ortholog of a gene or polypeptide described herein are known . to 
those skilled in the art and are described for example in Sambrook et al. (1989), 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
New York, or Ausubel (1994), Current Protocols in Mol. Biol., The person skilled in 
the art knows how to identify orthologous genes, nucleic acids or polypeptides by 
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computer supported analysis (e.g. BLAST) of known sequences and its 
interpretation. 

The terms "gene", "polynucleotide", "nucleic acid sequence", "nucleotide 
sequence", "DNA sequence" or "nucleic acid molecule" as used herein refer to 
polymeric forms of nucleotides of any length, either ribonucleotides or 
deoxyribonucleotides and only to the primary structure of the molecule. Thus, 
these terms include double- and single-stranded DNA, and RNA. They also 
include known types of modifications, for example, methylation, "caps" 
substitution of one or more of the naturally occurring nucleotides with an analog. 
Preferably, the DNA sequence of the invention comprises a coding sequence 
encoding at least the mature form of the above defined protein, i.e. the protein 
which is posttranslationally processed in its biologically active form, for example 
due to cleavage of leader or secretory sequences or a proprotein sequence or 
other natural proteolytic cleavage points. 

The term "plurality of candidate antagonists or inhibitors" is to be understood as a 
plurality of substances which may or may. not be identical. 

Said antagonists or inhibitors or plurality of candidate antagonists or inhibitors 
may be chemically synthesized or microbiologically produced and/or comprised in, 
for example, samples, e.g., cell extracts from, e.g., plants, animals or 
microorganisms. Furthermore, said compound(s) may be known in the art but 
hitherto not known to be capable of suppressing or inhibiting said polypeptide. 
The reaction mixture may be. a cell free extract or may comprise a cell or tissue 
culture. Suitable set ups for the method of the invention are known to the person 
skilled in the art and are, for example, generally described in Alberts et al., 
Molecular Biology of the Cell, third edition (1994), in particular Chapter 17. The 
plurality of compounds may be, e.g., added to the reaction mixture, culture 
medium, injected into the cell or sprayed onto the plant. 

By combining computational processing of genomic information with microbial 
genetics, the inventors have been able to identify 24 E. coli essential genes and 
their respective orthologs (Fig. 3) that fulfill several criteria for being attractive 
antibacterial targets: hypothetical open reading frames, coding for essential 
functions (mutation is lethal for growth in rich media), broad consen/ation 
(orthologs are present in a wide range of bacteria including H, influenza, S, 
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pneumoniae, H. pylori, and B, burgdorferi) (Fig. 3) and low toxicity potential in 
higher organisms (mostly no orthologs are identified in the simple eukaryote S. 
cerevisiae). Thus, an antagonist or inhibitor of the expression of such an essential 
gene or of its function provides the key for an antibacterial therapy. The inventors 
assume that said antagonist or inhibitor stops or reduces bacterial growth and/or 
mediates bacterial death. 

Thus, the method of the present invention provides the options of development of 
new broad spectnjm antibiotics against new pharmaceutical important targets. 
The findings of the present invention are particularly important in view of the 
drawbacks of the present forms of treatment of bacterial infections, diseases and 
disorders related to bacterial infections. 

in line with the above, the present invention also relates to a method for testing a 
candidate antagonist or inhibitor of a polypeptide or mRNA essential for bacterial 
growth or survival encoded by a gene selected from the group consisting of ygbB, 
yfhC, yacE, ychB, yejD, yrfl. yggj, yjeE, yiaO, yrdC. yhbC, ygbP, ybeY, gcpE, 
kdtB. pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD. yceG and/or yjbC or a fragment, 
derivative or ortholog thereof comprising the steps of 

(a) contacting a bacterial cell with candidate antagonist or inhibitor or a sample 
comprising a plurality of said candidate antagonists or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition and/or cell 
death. 

in a further embodiment, the present invention relates to a method for testing a 
candidate antagonist or inhibitor of the function of a gene essential for bacterial 
growth or survival wherein said gene is selected from the group consisting of 
ygbB. yfhC, yacE, ychB, yejb, yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, 
gcpE, kdtB, pfs, ycaJ. b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC .or a 
fragment, derivative or ortholog thereof, comprising the steps of 
(a) contacting a bacterial cell comprising said gene with a candidate antagonist 

or inhibitor or a sample comprising a plurality of said candidate antagonists 

or inhibitors; and 
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(b) testing whether said contacting leads to cell growth inhibition and/or cell 
death. 

Bacteria, for which was shown that a gene as mentioned above expressed is 
essential, can be used in a proliferation assay to identify both ligands and 
potential antagonists or inhibitors to said polypeptide encoded by said essential 
gene. For example, E. coil are grown in culture medium and incorporation of DNA 
precursors such as ^H-thymidine or 5-bromo-2'-deoxyuridine (BrdU) is monitored 
as a parameter for DNA synthesis and cellular proliferation. Cells which have 
incorporated BrdU into DNA can be detected using a monoclonal antibody against 
BrdU and measured by an enzyme or fluorochrome-conjugated second antibody. 
The reaction is quantitated by fluorimetry or by spectrophotometry. The ability of 
the compound to be screened to inhibit proliferation may then be quantified. 
Further methods to determine growth and proliferation of bacteria are well known 
in the art, for example in Drews, MikrobioL Praktikum, Berlin, 1976. 
Preferably, the antagonist or inhibitor binds to the gene product, i.e. the RNA or 
polypeptide, specifically encoded by said gene. 

For example, a candidate antagonist or inhibitor not known to be capable of 
binding to an polypeptide encoded by a essential gene as described above can be 
tested to bind thereto comprising contacting a bacterial cell comprising an isolated 
molecule encoding said polypeptide with a candidate antagonist or inhibitor under 
conditions permitting binding of ligands known to bind thereto, detecting the 
presence of any bound ligand, and thereby determining whether such candidate 
antagonist or inhibitor inhibits the binding of a ligand to a polypeptide as described 
above. 

Proteins that bind to a polypeptide as described above and might inhibit or 
counteract to said polypeptide can be "captured" using the yeast two-hybrid 
system (Fields, Nature 340 (1989), 245-246). A modified version of the yeast two- 
hybrid system has been described by Roger Brent and his colleagues (Gyuris, 
Cell 75 (1993), 791-803; Zen/os, Cell 72 (1993), 223-232). Briefly, a domain of the 
polypeptide is used as bait for binding compounds. Positives are then selected by 
their ability to grow on plates lacking leucine, and then further tested for their 
ability to turn blue on plates with X-gal-, as previously described in great detail 
(Gyuris, supra; WO 95/31544). Once amino acid sequences are identified which 
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bind to a polypeptide essential for bacterial growth or survival, these sequences 
can be screened for antagonist activity using, for example, the proliferation assay 
described above or used for screening for antagonists of said binding. 
Another assay which can be performed to identify inhibitors and antagonists 
involves the use of combinatorial chemistry to produce random peptides which 
then can be screened for both binding affinity and antagonist effects. One such 
assay has recently been performed using random peptides expressed on the 
surface of a bacteriophage (Wu (1996), Nature Biotechnology 14, 429-431). 

In a preferred embodiment of the method of the present invention said method 
further comprises identifying an antagonist or inhibitor optionally from said sample 
of candidate antagonists or inhibitors. 

If a sample contains a candidate antagonist or inhibitor, or a plurality of candidate 
antagonists or inhibitors, as identified in the method of the invention, then it is 
either possible to isolate the candidate antagonists or inhibitors from the original 
sample identified as containing the compound capable of suppressing or inhibiting 
bacterial growth or survival, or one can further subdivide the original sample, for 
example, if it consists of a plurality of different candidate antagonists or inhibitors, 
SO' as to reduce the number of different substances per sample and repeat the 
method with the subdivisions of the original sample. Depending on the complexity 
of the salmples, the steps described above can be performed several times, 
preferably until the sample identified according to the method of the invention only 
comprises a limited number of or only one substance{s). Preferably said sample 
comprises substances of similar chemical and/or physical properties, and most 
preferably said substances are identical. As regards the identification of candidate 
antagonists or inhibitors by any of the above-referenced embodiments of the, 
invention, a variety of formats or tools is available to the person skilled in the art. 
Thus, several methods are known to the person skilled in the art for producing 
and screening large libraries to identify compounds having specific affinity for a 
target. These methods include the phage-display method in which randomized 
peptides. are displayed from phage and screened by affinity chromatography to an 
immobilized receptor; see, e.g.. WO 91/17271, WO 92/01047, US-A-5,223.409. In 
another approach, combinatorial libraries of polymers immobilized on a chip are 
synthesized using photolithography; see, e.g., US-A-5, 143,854, WO 90/15070 
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and WO 92/10092. The immobilized polymers are contacted with a labeled 
receptor and scanned for label to identify polymers binding to the receptor. The 
synthesis and screening of peptide libraries on continuous cellulose membrane 
supports that can be used for identifying binding ligands of the polypeptide of the 
invention and thus possible inhibitors and antagonists is described, for example, 
in Kramer. Methods Mol. Biol. 87 (1998), 25-39. This method can also be used, 
for example, for determining the binding sites and the recognition motifs in the 
polypeptide as described above. In like manner, the substrate specificity of the 
DnaK chaperon was determined and the contact sites between human interleukin- 
6 and its receptor; see Rudiger, EMBO J. 16 (1997). 1501-1507 and Weiergraber. 
FEBS Lett. 379 (1996), 122-126, respectively. Furthermore, the above-mentioned 
methods can be used for the construction of binding supertopes derived from the 
polypeptide of the invention. A similar approach was successfully described for 
peptide antigens of the anti-p24 (HIV-1) monoclonal antibody; see Kramer, Cell 91 
(1997). 799-809. A general route to fingerprint analyses of peptide-antibody 
interactions using the clustered amino acid peptide library was described in 
Kramer. Mol. Immunol. 32 (1995), 459-465. In addition, antagonists or inhibitors of 
a polypeptide described above can be derived and identified from monoclonal 
antibodies that specifically react with said polypeptide in accordance with the 
methods as described in Doring, Mol. Immunol. 31 (1994), 1059-1067. 
More recently, WO 98/25146 described further methods for screening libraries of 
complexes for compounds having a desired property, especially, the capacity to 
agonize, bind to, or antagonize a polypeptide or its cellular receptor. The 
complexes in such libraries comprise a compound under test, a tag recording at 
least one step in synthesis of the compound, and a tether susceptible to 
modification by a reporter molecule. Modification of the tether is used to signify 
that a complex contains a compound having a desired property. The tag can be 
decoded to reveal at least one step in the synthesis of such a compound. Other 
methods for identifying compounds which interact with the proteins according to the 
invention or nucleic acid molecules encoding such molecules are, for example, the 
in vitro screening with the phage display system as well as filter binding assays or 
"real time" measuring of interaction using, for example, the BIAcore apparatus 
(Pharmacia). 
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All these methods can be used in accordance with the present invention to identify 
antagonists and inhibitors of the polypeptide of the invention. 

Additionally, the present invention relates in a preferred embodiment to a method 
comprising improving inhibitors or antagonists identified by peptidomimetics or by 
applying phage display or combinatorial library technique step(s). 
Peptidomimentics, phage display and combinatorial library techniques are well- 
known in the art and can be applied by the person skilled in the art without further 
ado to the improvement of the antagonist or inhibitor that is identified by the basic 
method referred to herein above. 

Methods for the generation and use of peptidomimetic combinatorial libraries are 
described in the prior art. for example in Ostresh, Methods In Enzymology 267 
(1996), 220-236; Dosner, Bioorg. Med. Chem. 4 (1996), 709-715; Beeley, Trends 
Biotechn. 12 (1994), 213-216; al-Obeidi. Mol. Biotechn. 9 (1998), 205-223; Wiley, 
Med. Res. Rev. 13 (1993), 327-384; Bohm, J, Comput. Aided Mol. Des. TO 
(1996), 265-272; and Hruby. Biopolymers 43 (1997), 219-266. 
Various sources for the basic structure of such an antagonist or inhibitor can be 
employed and comprise, for example, mimetic analogs of the polypeptide of the 
invention. Mimetic analogs of the polypeptide of the invention or biologically active 
fragments thereof can be generated by, for example, substituting the amino acids 
that are expected to be essential for the biological activity with, e.g., 
stereoisomers, i.e. D-amino acids; see e.g., Tsukida, J. Med. Chem. 40 (1997), 
3534-3541. Furthermore, in case fragments are used for the design of biologically 
active analogs pro-mimetic components can be incorporated into a peptide to 
reestablish at least some of the conformational properties that may have been lost 
upon removal of part of the original polypeptide; see, e.g., Nachman, Regul. Pept. 
57 (1995), 359-370. Furthermore, the polypeptide can be used to identify 
synthetic chemical peptide mimetics that bind to or can function as a ligand, 
substrate, binding partner or the receptor of the polypeptide as effectively as does 
the natural polypeptide; see, e.g., Engleman, J. Clin. Invest. 99 (1997), 2284- 
2292. 

The structure-based design and synthesis of low-molecular-weight synthetic 
molecules that mimic the activity of the native biological polypeptide is further 
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described in, e.g., Dowd, Nature Biotechnol. 16 (1998), 190-195; Kieber-Emmons. 
Current Opinion Biotechnol. 8 (1997), 435-441; Moore, Proc. West Pharmacol. 
Soc. 40 (1997), 115-119; Mathews, Proc. West Pharmacol. Soc. 40 (1997), 121- 
125; Mukhija, European J. Biochem. 254 (1998). 433-438. 

It is also well known to the person skilled in the art, that it is possible to design, 
synthesize and evaluate mimetics of small organic compounds that, for example, 
can act as a substrate or ligand to a polypeptide as encoded by the essential 
gene as identified above. For example, it has been described that D-glucose 
mimetics of hapalosin exhibited similar efficiency as hapalosin in antagonizing 
multidrug resistance assistance-associated protein in cytotoxicity; see Dinh, J. 
Med. Chem. 41 (1998), 981-987. 

The essential gene described above or the RNA encoded thereof, as has been 
described above, can also serve as a target for antagonists or inhibitors. 
Antagonists may comprise, for example, proteins that bind to the mRNA of said 
gene, thereby destabilizing the -native conformation of the mRNA and disturbing 
transcription and/or translation. Furthermore, methods are described in the 
literature for identifying nucleic acid molecules such as an RNA fragment that 
mimics the structure of a defined or undefined target RNA molecule to which a 
compound binds inside of a cell resulting in retardation of cell growth or cell death; 
see, e.g., WO 98/18947 and references cited therein. These nucleic acid 
molecules can be used for identifying unknown compounds of pharmaceutical 
and/or agricultural interest, and for identifying unknown RNA targets, for use in 
treating a disease. These methods and compositions can be used in screening for 
novel antibiotics, bacteriostatics, or modifications thereof or for identifying 
compounds useful to alter expression levels of proteins encoded by a nucleic acid 
molecule. Alternatively, for example, the conformational stnjcture of the RNA 
fragment which mimics the binding site can be employed in rational drug design to 
modify known antibiotics to make them bind more avidly to the target. One such 
methodology is nuclear magnetic resonance (NMR), which is useful to identify 
drug and RNA conformational structures. Still other methods are, for example, the 
drug design methods as described in WO 95/35367, US-A-5,322,933, where the 
crystal structure of the RNA fragment can be deduced and computer programs 
are utilized to design novel binding compounds which can act as antibiotics. 
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The candidate antagonists and Inhibitors which can be tested and identified 
according to a method of the invention may be taken from expression libraries, 
e.g., cDNA expression libraries, peptides, proteins, nucleic acids, antibodies, 
small organic compounds, hormones, peptidomimetics, PNAs or the like (Milner, 
Nature Medicine 1 (1995), 879-880; Hupp, Cell 83 (1995), 237-245; Gibbs, Cell 79 
(1994), 193-198 and references cited supra). Furthermore, genes encoding a 
putative regulator of an essential bacterial protein and/or which exert their effects 
up- or downstream said protein may be identified using, for example, insertion 
mutagenesis using, for example, gene targeting vectors known in the art (see, 
e.g., Hayashi, Science 258 (1992), 1350-1353; Fritze and Walden, Gene 
activation by T-DNA tagging. In Methods in Molecular biology 44 (Gartland, 
K.M.A. and Davey, M.R., eds). Totowa: Human Press (1995), 281-294) or 
transposon tagging (Chandlee, Physiologia Plantarum 78 (1990), 105-115). Said 
compounds can also be functional derivatives or analogues of known inhibitors or 
antagonists. Such useful compounds can be for example transacting factors 
which bind an above-described polypeptide. Identification of transacting factors 
can be carried out using standard methods in the art (see, e.g., Sambrook. supra, 
and Ausubel, supra). To determine whether a protein binds to the protein or 
regulatory sequence of the invention, standard native gel-shift analyses can be 
carried out. In order to identify a transacting factor which binds to the protein or 
regulatory sequence of the invention, the protein or regulatory sequence of the 
invention can be used as an affinity reagent in standard protein purification 
methods, or as a probe for screening an expression library. The identification of 
nucleic acid molecules which encode proteins which interact with the polypeptide 
described above can also be achieved, for example, as described in Scofield 
(Science 274 (1996), 2063-2065) by use of the so-called yeast "two-hybrid system"; 
see also the appended example. In this system, e.g., the protein encoded by the 
nucleic acid molecules identified in this invention or a smaller part thereof is linked to 
the DNA-binding domain of the GAL4 transcription factor. A yeast strain expressing 
this fusion gene and comprising a lacZ reporter gene driven by an appropriate 
promoter, which is recognized by the GAL4 or LexA transcription factor, is 
transformed with a library of cDNAs which will express plant genes or fragments 
thereof fused to an activation domain. Thus, if a peptide encoded by one of the 
cDNAs is able to interact with the fusion peptide comprising a peptide of a protein of 



wo 00/61793 PCT/EPOO/03135 

13 

the invention, the complex is able to direct expression of the reporter gene. In this 
way the nucleic acid molecules and the encoded peptide can be used to identify 
peptides and proteins interacting with the polypeptide described above. It is 
apparent to the person skilled in the art that this and similar systems may then 
further be exploited for the identification of inhibitors or antagonists of the 
polypeptide. 

Once the transacting factor is identified, modulation of its binding to or regulation 
of expression of the polypeptide described above can be pursued, beginning with, 
for example, screening for inhibitors against the binding of the transacting factor to 
the protein specified in accordance with the present invention. Inhibition of 
bacterial growth could then be achieved by applying the transacting factor (or its 
inhibitor). In addition, if the active form of the transacting factor is a dimer, 
dominant-negative mutants of the transacting factor could be made in order to 
inhibit its activity. 

Thus, the present invention also relates to the use of the polypeptide as defined 
above for the identification of antagonists or inhibitors of a polypeptide essential 
for bacterial growth or survival. 

In another embodiment, the present invention relates to a method for designing an 
improved antagonist or inhibitor for the treatment of a bacterial infection or 
disorder or disease related to a bacterial infection comprising the steps of 

(a) identification of the binding site of an antagonist or inhibitor to the 
polypeptide ygbB. yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE. yiaO, yrdC, 
yhbC, ygbP. ybeY, gcpE, kdtB, pfs, ycaJ, b1808. yeaA, yagF, b1983, yidD. 
yceG and/or yjbC, the sequence of said genes being shown in Fig. 1, or 
identified according to the method of the present invention, by site-directed 
mutagenesis and chimeric polypeptide studies; 

(b) molecular modeling of both the binding site of said antagonist or inhibitor 
and the structure of said polypeptide; and 

(c) modification of said antagonist or inhibitor to improve its binding specificity 
or affinity for the polypeptide. 



Biological assays as described above or other assays such as assays based on 
crystallography or NMR may be employed to assess the specificity or potency of 
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the antagonist or inhibitor wherein the decrease of one or more activities of the 
polypeptide may be used to monitor said specificity or potency. All techniques 
employed in the various steps of the method of the invention are conventional or 
can be derived by the person skilled in the art from conventional techniques 
without further ado. 

For example, identification of the binding site of said antagonist or inhibitor by site- 
directed mutagenesis and chimerical protein studies can be achieved by 
modifications in the (poly)peptide primary sequence that affect the antagonist's or 
inhibitor's affinity; this usually allows to precisely map the binding pocket for the 
drug, identification of binding sites may be assisted by computer programs. Thus, 
appropriate computer programs can be used for the identification of interactive 
sites of a putative antagonist or inhibitor and the polypeptide of the invention by 
computer assisted searches for complementary structural motifs (Fassina, 
Immunomethods 5 (1994). 114-120). 

As regards step (b), the following protocols may be envisaged: Once the effector 
site for antagonists or inhibitors has been mapped, the precise residues 
interacting with different parts of the antagonists or inhibitors can be identified by 
combination of the information obtained from mutagenesis studies (step (a)) and 
computer simulations of the structure of the binding site provided that the precise 
three-dimensional structure of the antagonists or inhibitors is known (if not, it can 
be predicted by computational simulation). If said antagonist or inhibitor is itself a 
peptide, it can be also mutated to determine which residues interact with others in 
the above-mentioned polypeptide essential for bacterial growth and survival. 
Finally, in step (c) the antagonist or inhibitor can be modified to improve its 
binding affinity or its potency and specificity. If, for instance, there are electrostatic 
interactions between a particular residue of an polypeptide as defined above and 
some region of an antagonist or inhibitor molecule, the overall charge in that 
region can be modified to increase that particular interaction. Furthermore, the 
three-dimensional and/or crystallographic stmcture of inhibitors or antagonists of 
the polypeptide of the invention can be used for the design of peptidomimetic 
inhibitors or antagonists, e.g. in combination with said polypeptide (Rose, 
Biochemistry 35 (1996), 12933-12944; Rutenber, Bioorg. Med. Chem. 4 (1996), 
1545-1558). 
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Potential antagonists/inhibitors include antisense molecules. Antisense 
technology can be used to control gene expression through antisense DNA or 
through triple-helix formation. Antisense techniques are discussed, for example, in 
Okano, J. Neurochem. 56 (1991), 560; Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Triple helix 
formation is discussed in, for instance, Lee, Nucl. Acids Res. 6 (1979). 3073; 
Cooney, Science 241 (1988), 456; and Dervan, Science 251 (1991), 1360. The 
methods are based on binding of a polynucleotide to a complementary DNA or 
RNA. 

For example, the 5' coding portion of a polynucleotide that encodes the mature 
polypeptide as described above may be used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide 
is designed to be complementary to a region of the gene involved in transcription 
thereby preventing transcription and the production of the protein. The antisense 
RNA oligonucleotide hybridizes to the mRNA and blocks translation of the mRNA 
molecule into receptor polypeptide. As indicated, antagonist or inhibitor e.g. 
polyclonal and monoclonal antibody according to the teachings of the present 
invention can be raised according to the methods disclosed in Tartaglia, J. Biol. 
Chem. 267 (1992), 4304-4307; Tartaglia, Cell 73 (1993), 213-216, and PCT 
Application WO 94/09137. 

Antibodies may be prepared by any of a variety of methods using immunogens of 
the polypeptide described above. As indicated, such immunogens include the full 
length polypeptide (which may or may not include the leader sequence) and 
fragments such as the ligand binding domain, the extracellular domain and the 
intracellular domain. These antibodies can be monoclonal antibodies, polyclonal 
antibodies or synthetic antibodies as well as fragments of antibodies, such as 
Fab*, Fv. F(ab')2, disulphide-bridged Fv or scFv fragments, etc. Monoclonal 
antibodies can be prepared, for example, by the techniques as originally 
described in Kohler and Milstein, Nature 256 (1975), 495, and Galfre. Meth. 
Enzymol. 73 (1981), 3, which comprise the fusion of mouse myeloma cells to 
spieen cells derived from immunized mammals. Furthermore, antibodies or 
fragments thereof to the aforementioned peptides can be obtained by using 
methods which are described, e.g.. in Harlow and Lane "Antibodies, A Laboratory 
Manual", CSH Press, Cold Spring Harbor. 1988. 
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The antagonists or inhibitors isolated by the above methods also serve as lead 
compounds for the development of analog compounds. The analogs should have 
a stabilized electronic configuration and molecular conformation that allows key 
functional groups to be presented to the receptor in substantially the same way as 
the lead compound. In particular, the analog compounds have spatial electronic 
properties which are comparable to the binding region, but can be smaller 
molecules than the lead compound, frequently having a molecular weight below 
about 2 kD and preferably below about 1 kD. Identification of analog compounds 
can be performed through use of techniques such as self-consistent field (SCF) 
analysis, configuration interaction (CI) analysis, and normal mode dynamics 
analysis. Computer programs for implementing these techniques are available; 
e.g., Rein, Computer-Assisted Modeling of Receptor-Ligand Interactions (Alan 
Liss, New York, 1989). Methods for the preparation of chemical derivatives and 
analogues are well known to those skilled in the art and are described in, for 
example, Beilstein, Handbook of Organic Chemistry, Springer edition New York 
Inc., 175 Fifth Avenue, New York, N.Y, 10010 U.S.A. and Organic Synthesis, 
Wiley. New York, USA. Furthermore, said derivatives and analogues can be 
tested for their effects according to methods known in the art. Furthermore, 
peptidomimetics and/or computer aided design of appropriate derivatives and 
analogues can be used, for example, according to the methods described above. 

The inhibitor or antagonist identified by the above-described method may prove 
useful as a pesticide, and/or antibiotic. The inhibitors and antagonists of the 
present invention preferably have a specificity at least substantially identical to the 
binding specificity of the natural ligand or binding partner of the polypeptide 
described above. An antagonist or inhibitor can have a binding affinity to said 
polypeptide of at least IG^M*"'; preferably higher than lO^M""^ and advantageously 
up to 10^°M'V In a preferred embodiment, an inhibitor, e.g. suppressive antibody, 
has an affinity of at least about 10**^ M, preferably at least about 10'^ M and most 
preferably, at least about 10*''^ M; and the antagonist has an affinity of less than 
about 10'*^ M. preferably less than about 10'^ M and most preferably in order of 10' 
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in the case of nucleic acid molecules it is preferred that they have a binding 
affinity to those encoding the annino acid sequences encoded in any one of SEQ 
ID NOS: 16 to 39 of at nnost 2-. . 5- or 10-fold less than an exact complement of 20 
consecutive nucleotides of the above described nucleic acid molecules. 

In another embodiment, the present invention relateis to a method for producing a 
therapeutic agent comprising synthesizing the above-described antagonist or 
inhibitor. 

Preferably, the compound identified according to the above described method or 
its analog or derivative is further formulated in a therapeutically active form or in a 
form suitable for the application against bacterial infections or diseases related to 
such an infection. For example, it can be combined with a pharmaceutically 
acceptable carrier known in the art. Thus, the present invention also relates to a 
method of producing a (therapeutically effective) composition comprising the 
steps of one of the above described methods of the invention and combining the 
compound obtained or identified in the method of the invention or an analog or 
derivative thereof with a pharmaceutically acceptable carrier. 

Also, the present invention relates to a composition comprising the antagonist or 
inhibitor mentioned above. As is evident from the above, the present invention 
generally relates to compositions comprising at least one of the aforementioned 
antagonists or inhibitors, which may be nucleic acid molecules, proteins or 
antibodies. Advantageously, said composition is for use as a medicament, a 
diagnostic means, or a kit. 

The term "composition", as used in accordance with the present invention, 
comprises at least one small molecule or molecule as identified herein above, 
such as a protein, an antigenic fragment of said protein, a fusion protein, a nucleic 
acid molecule and/or an antibody as described above and, optionally, further 
molecules, either alone or in combination, like e.g. molecules which are capable 
of optimizing antigen processing, cytokines, immunoglobulins, lymphokines or 
CpG-containing DNA stretches or, optionally, adjuvants. The composition may be 
in solid, liquid or gaseous form and may be, inter alia, in form of (a) powder(s). (a) 
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tablet(s), (a) solution(s) or (an) aerosol(s). In a preferred embodiment, said 
composition comprises at least two. preferably three, more preferably four, most 
preferably five differentially synthesized proteins. 

The antagonists and inhibitors of the invention appear to function against gene 
products which are essential in several strains or genera of bacteria. Accordingly, 
the above-described antagonists and inhibitors may be used to inhibit the growth 
of a wide spectrum of bacteria. The above described antagonists or inhibitors may 
be used to slow, stop, or reverse bacterial growth. Thus, the present invention 
also relates to a method of producing a therapeutic agent comprising the steps of 
the methods described hereinbefore and synthesizing the antagonist or inhibitor 
obtained or identified as described above or an analog or derivative thereof, 
preferably in an amount sufficient to provide said agent in a therapeutically 
effective amount to a patient. 

Compounds identified by the above methods or analogs are formulated for 
therapeutic use as pharmaceutical compositions. The compositions can also 
include, depending on the formulation desired, pharmaceutically acceptable, 
usually sterile, non-toxic carriers or diluents, which are defined as vehicles 
commonly, used to formulate pharmaceutical compositions for animal or human 
administration. The diluent is selected so as not to affect the biological activity of 
the combination. Examples of such diluents are distilled water, physiological 
saline. Ringer's solutions, dextrose solution, and Hank's solution. In addition, the 
pharmaceutical composition or formulation may also include other carriers, 
adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like. 
A therapeutically effective dose refers to that amount of protein or its antibodies, 
antagonists, or inhibitors which ameliorate the symptoms or condition. 
Therapeutic efficacy and toxicity of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
ED50 (the dose therapeutically effective in 50% of the population) and LD50 (the 
dose lethal to 50% of the population). The dose ratio between therapeutic and 
toxic effects is the therapeutic index, and it can be expressed as the ratio, 
LD50/ED50. 

Compositions comprising such carriers can be formulated by well known 
conventional methods. These pharmaceutical compositions can be administered 
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to the subject at a suitable dose. Administration of the suitable compositions may 
be effected by different ways, e.g., by intravenous, intraperitoneal, subcutaneous, 
intramuscular, topical, intradermal, intranasal or intrabronchial administration. The 
dosage /egimen will be determined by the attending physician and clinical factors. 
As is well known in the medical arts, dosages for any one patient depends upon 
many factors, including the patient's size, body surface area, age, the particular 
compound to be administered, sex, time and route of administration, general 
health, and other drugs being administered concurrently. Proteinaceous 
pharmaceutically active matter may be present in amounts between 1 ng and 10 
mg per dose; however, doses below or above this, exemplary range are 
envisioned, especially considering the aforementioned factors. Administration of 
the suitable compositions may be effected by different ways, e.g., by intravenous, 
intraperitoneal, subcutaneous, intramuscular, topical or intradermal administration. 
If the regimen is a continuous infusion, it should also be in the range of 1 pg to 10 
mg units per kilogram of body weight per minute, respectively. Progress can be 
monitored by periodic assessment. The compositions of the invention may be 
administered locally or systemically. Administration will generally be parenterally, 
e.g., intravenously. The compositions of the invention may also be administered 
directly to the target site, e.g., by biolistic delivery to an internal or external target 
site or by catheter to a site in an artery. Preparations for parenteral administration 
include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. Parenteral vehicles include 
sodium ^chloride solution. Ringer's dextrose, dextrose and sodium chloride, 
lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient 
replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), 
and the like. Preservatives and other additives may also be present such as, for 
example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the 
like. Furthermore, the pharmaceutical composition of the invention may comprise 
further agents such as interleukins, interferons and/or CpG-containing DNA 
stretches, depending on the intended use of the pharmaceutical composition. 
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In another embodiment, the present invention relates to a kit comprising at least 
one of the aforementioned antagonists or inhibitors of the invention. The kit of the 
invention as well as the composition may in a preferred embodiment contain 
further ingredients such as selection markers, antibiotics, cytokines and 
components for simplifying or supporting the treatment of bacterial infections or 
disorders or diseases related to bacterial infections. The kit of the invention may 
advantageously be used for carrying out the method of the invention and could 
be, inter alia, employed in a variety of applications referred to herein, e.g., in the 
diagnostic field or as research tool. The parts of the kit of the invention can be 
packaged individually in vials or in combination in containers or multicontainer 
units. Manufacture of the kit follows preferably standard procedures which are 
known to the person skilled in the art. The kit or its ingredients according to the 
invention can be used in antibacterial therapies, for example, for any of the above 
described methods for detecting further inhibitors and antagonists essential for 
bacterial growth and survival. The kit of the invention and its ingredients are 
expected to be very useful for the healing and protection of animals and humans 
suffering from a bacterial infection. 

The present invention also relates to a method for treating or preventing bacterial 
infections or diseases or disorders related to bacterial infections comprising the 
step of administering to a subject in need thereof an antagonist or inhibitor 
identified herein above, optionally comprised in a pharmaceutical composition of 
the invention. 

In another embodiment the present invention relates to the use of a polypeptide 
encoded by the gene as identified above or a fragment, derivative or ortholog 
thereof or of any of said genes for the identification of an antagonist or inhibitor of 
said polypeptide fragment, derivate or ortholog or said gene. 

In a further embodiment the present invention relates to the use of said 
polypeptide, the therapeutic agent produced according to the invention, the 
antagonist or inhibitor obtained or identified by the method or use according to the 
invention for the preparation of .a pharmaceutical composition for the treatment of 
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(a) bacterial infection(s), disorcler(s) and/or disease(s) related to bacterial 
infections. 

In another embodiment the present invention relates to a method for treating or 
preventing bacterial infections or diseases or disorders related to bacterial 
infections comprising the step of administering to a subject in need thereof an 
antagonist or inhibitor identified herein above, optionally comprised in the 
pharmaceutical composition according to the present invention. 

In a further embodiment the present invention relates to the use of the above- 
described polypeptide, a fragment, derivative or ortholog thereof or of any of said 
genes for screening for polypeptides interacting with said polypeptide using 
protein-protein interaction technologies, and/or for validating such interaction as 
being essential for bacterial survival and/or for screening for antagonists or 
inhibitors of such interaction. 

In a further embodiment the present invention relates to the use of the above- 
described polypeptide, a fragment, derivative or ortholog thereof or of any of said 
genes for screening of polypeptide for polypeptide binding to said polypeptide, 
and/or for validating the peptides binding to said polypeptide as preventing growth 
of bacteria or being lethal to bacteria upon expression of said polypeptides in said 
bacteria, and/or for screening for small molecules competitively displacing said 
peptides. 

In another embodiment the present invention relates to the use of a conditional 
mutant of a gene as described above or a fragment, derivative or ortholog thereof 
or of surrogate ligands against said gene expressed in bacteria to induce a lethal 
phenotype in bacteria and/or for the analysis of said bacteria for surrogate 
markers by comparison of RNA or protein profiles in said bacteria with RNA or 
protein profiles in wild type bacteria, and/or the use of said surrogate markers for 
the identification of antagonists of the essential function of said gene. 
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In another embodiment the present invention relates to a method for identifying or 
isolating a surrogate marker comprising the steps as described in the above- 
recited method of the present invention. 

In a further embodiment the present invention relates to a method for identifying 
or isolating a surrogate marker comprising the steps of 

(a) inducing a Jethal phenotype in bacteria representing a conditional mutant of 
a gene selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, 
yrfl, yggJ, yjeE, yiaO, yrdC, yhbC. ygbP, ybeY, gcpE. kdtB, pfs, ycaJ, 
b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC; and 

(b) analysing said bacteria comparing the RNA or protein profile of said 
bacteria with wild type bacteria. 

. The invention also relates to the above recited genes and polypeptides and 
fragments, derivatives and orthologs thereof. 

These and other embodiments are disclosed and encompassed by the description 
and examples of the present invention. Further literature concerning any one of 
the methods, uses and compounds to be employed in accordance with the 
present invention may be retrieved from public libraries, using for example 
electronic devices. For example the public database "Medline" may be utilized 
which is available on the Internet, for example under 
http://www.ncbi:nlm. nih.gov/PubMed/medline. html. Further databases and 
addresses, such as http://www.ncbi.nlm.nih.gov/, 

http://www.infobiogen.fr/, http://www.fmi.ch/biology/research_tools.html, http://ww 
w.tigr.org/, are known to the person skilled in the art and can also be obtained 
using, e.g., http://vww.lycos.com. An overview of patent information in 
biotechnology and a survey of relevant sources of patent information useful for 
retrospective searching and for current awareness is given in Berks, TIBTECH 12 
(1994), 352-364. 

The present invention is further illustrated by reference to the following non- 
limiting examples. 
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Unless stated otherwise in the examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook et al. (1989), 
Molecular Cloning : A Laboratory Manual. Cold Spring Harbor Laboratory Press, 
NY or in Volumes 1 and 2 of Ausubel et al. (1994), Current Protocols in Molecular 
Biology, Current Protocols. Standard materials and methods for plant molecular 
work are described in Plant Molecular Biology Labfase (1993) by R.D.D. Croy, 
jointly published by BIOS Scientific Publications Ltd. (UK) and Blackwell Scientific 
Publications (UK). 

Brief description of the figures 

Figure 1 : Sequences of the essential bacterial genes identified according to 
the method described in the examples 

Figure 2: PGR strategy and the position of primers used 

Figure 3: Sequence comparison table of essential E.coli genes with proposed 
orthologs from various bacteria. Unfinished genomes are indicated 
by asterisk. Complete genomes were analysed using BlastP2. 
Unfinished genomes were analysed with TBIastN. Orthologous 
sequences can be accessed at the respective WWW links as 
indicated in the footnotes. 

Figure 4; Multiple Sequence -Alignment (MSA) of E. coll gene ygbB with 
orthologs in 5 different bacterial organisms including homology 
score. Similar MSA with similar results have been created for all 22 
essential bacterial genes. 

Example 1 



An automated BLASTP-based genome comparisons to identify E. coli FUN genes 
resulted in the following list of 65 candidate genes which are consen/ed between 
E. coli B. subtilis, H. influenzae, H. pylori, M, tuberculosis, Ch, trachomatis. B. 
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burgdorferi, T. pallidum, S. pneumoniae, S. aureus, E. faecalis. P. aeruginosa, B, 
pertussis and which were further analysed: 



FUN 
Genes 


Gene Bank 
Accession 
Number 


FUN 
Genes 


Gene Bank 
Accession 
Number 


FUN 
Genes 


Gene Bank 
Accession 
Number 


ygbB 


g1789103 


yggs 


g1 789321 


yaeE 


g 1786397 


yhaD 


g 1789512 


yggv 


g 1789324 


yicC 


g 1790075 


yhbU 


g1 789548 


yggw 


g1789325 


yebK 


g1788159 


yhiN 


g2367234 


yjhG 


g2367371 


yhbC 


g 1789561 


yieG 


g1 7901 50 


yjiR 


g 1790797 


ygbP 


g1789104 


yihZ 


g 1790320 


yohl 


g 1788462 


ybaX 


g 1786648 


yjgF 


g1790691 


yqhThom 


g 1788728 


yqcD 


91789158 


yacE , 


g 1786292 


yfiH 


g1 788945 


ybeY 


g1 786880 


yaeC 


g 1786396 


yhaR 


g1 789501 


gcpE 


g1 788863 


yagF 


g 1786464 


yhdG 


g1 789660 


kdtB 


g 1790065 


ybeB 


g 1786856 


yccG 


g1787197 


pfs 


g1 786354 


ycfH 


g1 47382 


ychB 


g 1787459 


sms 


g 1790850 


ydcP 


g1 787705 


yejD 


g1 788510 


ycaJ 


g1787119 


ydiB 


g1 787983 


yidD 


g140861. 


yhhF 


g1 789875 


yebl 


g1 788166 


yrfl 


g 1789804 


yleA 


g1 786882 


yeeC 


g1 788320 


yggJ 


g1789315 


b1808 


g1788110 
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FUN 
Genes 


Gene Bank 
Accession 
Number 


FUN 
Genes 


Gene Bank 
Accession 
Number 


FUN 
Genes 


Gene Bank 
Accession 
Number 


yegQ 


g1 788397 


yjeE 


g1 790610 


yeaA 


g 1788077 


yfcB 


g1 788670 


yiaO 


g 1790004 


b1675 


9 1787964 


yfgB 


g1 788865 


yrdC 


g2367210 


yhbU/yegQ 

a 


g 1789548/ 
91788397 


yfhC 


g 1788911 


b1983 


g1788294 


yjgF/yhaR^ 


g 1790691 / 
g 1789501 


ydiD 


g1 787993 


yeeS 


g1 736671 


b2385 


g 1788728 


nlpA 


g72589 


yaaJ 


g1786188 


yicO 


g 1790097 


yfjY 


g 1788997 


ydhE 


g 1742737 


yebC 


g140614 


ykfG 


g2367100 


yjcD 


9396399 


yolil/yhdG^ 


g 1788462/ 
g 1789660 


ygcA 


g1789148 


yceG 


g1 787339 


smpB 


91788973 


ygfA 


g1 789278 


yjbC 


9396357 







^: double mutants were created when the respective genes were paraiogues in 
E.coli 



Creating in-frame deletions of £. colt genes 

The subsequent description of the construction of deletion mutants was 
carried out essentially equal for these 77 candidate genes. Particular details will 
exemplarily be described for one gene which gave rise to be essential (yfhC) and 
one which was non-essential (yggV). 
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1) Principle of the PCR-procedure and primer-design for in frame deletions: 

Unless an overlapping ORF exists, primers dgenX2 and dgenX3 are 
designed to delete the entire ORF from ATG to STOP, e.g.: 
ATGgttataaatttggagtgtgaaggttattgcgtgTAA (SEQ ID NO: 1) (see figure). The Sp- 
ends of primers dgenXI and dgenX4 contain random nucleotides followed 
preferably by a BamHl site (dgenXI) or a Sail site (dgenX4) for cloning into 
plasmid pK03 (Link et al (1997), J Bac 179: 6228-6237). In most mutants, primers 
dgenX2 and dgenX3 contain a 33 bp tag sequence called "Church-tag". 

Church-tag forward direction: 5'-gttataaatttggagtgtgaaggttattgcgtg-3' (SEQ ID 
NO: 2) 

Church-tag reverse direction: 5'-cacgcaataaccttcacactccaaatttataac-3' (SEQ ID NO: 
3) 



This tag is used for a subsequent PGR in which the 5'- and 3'- flanking DNA- 
fragments of the deletion construct are assembled. 



In the few constructs lacking the "Church-tag", the primers dgenX2 and dgenX3 
carry at their 5 -ends 5 random nucleotides followed by a restriction site 
(preferably EcoRI) which by its positioning creates the in frame deletion. 

Oligos cgenXI and cgenX2 are used for the verification of the chromosomal 
situation (wild type or deletion) after the replacement procedure (Fig. 2). 

Primers for the respective candidate genes were designed as follows: 
dyfhCI; 5'-GATCGGATCCAAATTCCAGTTAGCCATGATGCGGTC-3' 
(SEQ ID NO: 4) 

dyfhC2: S'-CACGCAATAACCTTCACACTCCAAATTTATAACCATTATA 

CACGGACGCTATGC-3' (SEQ ID NO: 5) 
dyfhC3: 5 - 

GTTATAAATTTGGAGTGTGAAGGTTATTGCGTGACGGATTAATT 

TTGTTTCTCTT-3' (SEQ ID NO: 6) 
dyfhC4: 5'-GATCGTCGACGCGCTCGATATCACCGATGAACAACCG-3' 

(SEQ ID NO: 7) 

cyfhCI : 5*-CAATCCGCTGCTTTATTTCTGTCAG-3* (SEQ ID NO: 8) 
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cyfhC2: 5'-TTATAACGAAATCAACGGGAAACCT-3' (SEQ ID NO: 9) 



dyggVI : 5'-GATCGGATCCCTCTAAAAAATAAGGAATTAAAGG-3' 
(SEQ ID NO: 10) 

dyggV2: 5'-CACGCAATAACCTTCACACTCCAAATTTATAACCATAGGATAC 

CTAATTAATTAAC-3' (SEQ ID NO: 11 ) 
dyggVS: 5'-GTTATAAATTTGGAGTGTGAAGGTTATTGCGTGAAGAGCGCC 

ATTTCCCACCGT-3' (SEQ ID NO: 12) 
dyggV4: 5'-GATCGTCGACTCATATTGCTGATAACCCGCTGCGGT-3' 

(SEQ ID NO: 13) 

cyggVI : 5'-GTTGACGGCCAGGCCAACAGTCAT-3' (SEQ ID NO: 14) 
cyggV2: 5'-ATAACCCTGGGCAATCGCCTCG-3' (SEQ ID NO: 15) 



Example 2 

Construction of the DNA-fragments comprising the deletion 

The 5'- and the 3'-flanking DNA fragments are PGR amplified in a total 
volume of 50 pi as follows: 

Chromosomal DNA from E. coli strain MG1655 (100 ng/^\): 

final cone: 1 ng/pl 
final cone.: 1x 
final cone: 500 nM 
final cone: 500 nM 
final cone.: 5 U/100 pi 
final cone: 250 \iM 
to adjust volume to 50 \il 

PCR conditions: 

4' 94 °C 



10*Pwo-buffer 
dgenX1/3 (10^M) 
dgenX2(4)(10uM) 
Pwo-Polymerase 
dNTPs (25 mM) 
H2O 
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30 cycles: 30'* 94 °C. 30" 44 °C, V 72 °C 
5'72°C 

The PGR products are then purified with the High Pure PGR Purification Kit 
(Boehringer) to remove salts and enzyme (elute in 50 ul H2O). Altematively, if 
PGR products contain prominent impurities, the respective fragment must be 
purified by agarose gel extraction (Gene Clean, Dianova) before the fragment 
assembly. 

Assembly PGR 

Equal amounts of 5'- and 3*-fragment are applied as template DNA. In general a 
volume applied for gel electrophoresis giving an intense band is o.k. The total 
reaction volume is 100 pi. For the assembly the "outer" primers dgenXI and 
dgenX4 were used. 



5'-Fragment 
3 -Fragment 
10*Pwo-buffer 
dgenXI (10 ^iM) 
dgenX4 (10 ^M) 
Pwo-Pol (Boehringer) 
dNTPs (25mM) 
H2O 



approx. 10 ng 
approx. 10 ng 
final cone: 1x 

final cone: 500 nM (50 pmol/100 pi) 
final cone: 500 nM (50 pmol/100 pi) 
final cone: 5 U/100 pi 
final cone: 250 pM 
add to 100 ^1 



PCR conditions: 

4^ 94 °G 

10 cycles: 30" 94 °G. 30" 44 °G, r 72 °G 
25 cycles: 30" 94 °G, 30" 44 "G, 3' 72 °G 
5' 72 ^C 



The success of the PGR is checked by agarose gel electrophoresis. The 
assembled PCR product is purified with the High Pure PGR Purification Kit and 
the complete eluate of 50 ul is over-night digested with BamHI and Sail in a 
volume of 60 ul. After gel electrophoresis the digested product is purified with 
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3) Plasmid-DNA from 4 ml over-night culture is prepared using a QIAgen 
Miniprep Kit and a double restriction analysis with BamHI/Sall and 
EcoRI/Hindlll is performed to verify the clones. 

Protocol referring to the construction of assembly products by a 
restriction site: 

The 5'- and the 3'-fragments are PGR amplified as described above. The PGR 
products are purified with the High Pure PGR Purification Kit (Boehringer) to 
remove salts and enzyme and 5 to 10 }il are digested over night using the 
restriction site creating the deletion (primers 2 and 3; mostly EcoRI) in a total 
volume of 30 |jl. The restriction products are again purified with the High Pure 
PGR Purification Kit to remove nucleotides, salts and enzyme. (Alternatively: 
Following preparative agarose gel electrophoresis the cut fragments are isolated 
using Gene Glean (Dianova) and eluted in a volume of 25 ^1. The cut fragments 
(3-6 }al each) are ligated in a volume of 10-15 ^1 using T4-DNA ligase for 2 hours 
at room temperature, 5 }il of this ligation mix is directly used as a template for a 
second PGR. In this PGR, the assembled fragments are amplified using primers 
dgenXI and dgenX4. The reaction is set up as described above with two 
exceptions: 1) The total reaction volume is 100 pi and 2) the extension step at 72 
°G lasts 3'. 

Example 3 

The chromosomal exchange strategy 

(Link et al (1997), J Bac 179; 6228-6237) 

Cointegration: 

Gointegration = integration of a plasmid into the chromosome by a recombination 
event 

The pK03 derivative is transformed into MG1655 or any recA+ strain 
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Day 1 

The strain is grown at 30 °C in LB containing 20 ug/ml chloramphenicol (LB- 
Cam20) to an ODeoo of -1.0. Afterwards, perform 10-fold serial dilutions in the 
same medium (down to 10'^). For the following plating use prewarmed LB-Cam20 
agar plates. Plate 100 |il of dilutions 10"^ and 10*^ for incubation at 44 °C and 100 
^1 of dilutions 10*^ and 10'^ for incubation at 30 

Day 2 

Following incubation at the respective temperature, determine the factor 
c.f.u.44 °C/c.f.u.30 °C (c.f.u. = colony forming units). This factor for pK03 without 
insert is in the range 1*10""^ to 5*10"^ and should be significantly larger in the case 
of successful cointegration. Purify 8 randomly chosen clones from the 44 °C plate 
twice on LB-Cam20 agar plates at 44 °C (during Day 2 and over night to Day 3). 
Optionally, confirm the clones for their identity as cointegrates by colony-PCR. 

Resolution and counter-selection: 

Resolution = resolution of the cointegrate resulting in a self repiicative plasmid by 
a second recombination event 

Counter-selection = selection against the presence of plasmid in the cell 
Day 3 

Pool single colonies from each of the 8 cointegrates in 100 )il LB and use this 
suspension as an inoculum for 10 ml LB+5 %sucrose. After growth at 30 °C (8 to 
10 hours during a day is sufficient) 10-fold serial dilutions are performed and 100 
111 of dilutions 10^, 10*^ and 10"^ are plated onto LB agar+5 % sucrose and grown 
over night at 30 °C. 

Day 4 

50 single colonies are replica streaked on LB+Cam20 and LB+5 % sucrose to test 
for the loss of plasmid. 
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Example 4 

Testing for essentiality of FUN genes of E. coli and interpretation of the 

results 

Day 5 

The clones sensitive to chloramphenicol are then tested for their genotype (wild 
type versus in-frame deletion) by colony-PCR using primers cgenXI and cgenX2 
(10-48 clones). 

In the case of the gene yfhC out of 48 clones tested only wild type situation on the 
chromosome could be detected. 

In the case of the gene yggV out of 48 clones 16 {= 33 %) revealed a PGR 
product with a size indicative for the deletion situation on the chromosome. 

Are 48 clones revealing no mutant enough to claim a gene as essential? This 
question can be answered by asking for the number of clones that have to be 
tested to get a confidence of e.g. 99 % that really no mutants are present in an 
infinite number of clones. Provided a hypothesis Ho means that only the wild type 
genotype is viable and hypothesis Hi means that a fraction (1-x) of mutants is 
allowed to occur together with the wild type (x) among a population of clones (x + 
(1-x)), then the probability to make the wrong decision (decision for Ho whereas Hi 
is true) can be calculated as 

(1) , x"/(1+x") 

where x is the fraction of wild type clones and n is the number of clones tested. 
The confidence niveau a to make the wrong decision (error probability) is given by 

(2) a>x"/(1+x") 
thereby resulting in 

(3) n>ln(a/(1-a))/ln(x) 

for the number of clones that have to be tested to prove or disprove hypothesis 
Ho. 
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If the average probability for obtaining wild type clones (x) in a replacement 
experiment is 70 % (experimentally determined for 43 non-essential genes out of 
65 candidate genes), then, after testing of 26 clones which reveal a wild type 
genotype an uncertainty of 0.01 % error probability (a) remains that the claiming 
of a gene as essential could be wrong. Even if the rate of obtaining wild types (x) 
is set to 85 % (a value which occurs with a frequency of 10 % for replacement 
experiments with non-essential genes), then, by testing 32 clones (which was 
performed in every experiment giving rise to an essential gene) an error 
probability of only 0.6 % remains to chose the wrong hypothesis. 



Examie 5 

List of essential FUN genes obtained 

By the described method the following 24 genes were obtained which gave no 
deletion genotype and are therefore claimed^to be essential: 



£. coli 


gene name GenBank# 


ygbB 


g1789103 


yfhC 


g 1788911 


yacE 


g 1786292 


ychB 


g1 787459 


yejD 


g 1788510 


yrfl 


g 1789804 


yggJ 


g 178931 5 


yjeE 


g 1790610 


yiaO 


g 1790004 


yrdC 


g2367210 


yhbC 


g 1789561 


ygbP 


g 17891 04 


ybeY 


g 1786880 
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y I / OOODo 


krItR 


y 1 / yuuDO 




g 1 ^ ODO04 


\/r*a 1 
yud J 


g 1 /d M I y 


U 1 OUO 


gi /^ooi ID 


yeaA 


g1 788077 


yagF 


g 1786464 


b1983 


g 1788294 


yidD 


g140861 


yceG 


g 1787339 


yjbc 


g3g6357 
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CLAIMS 

A method for identifying an antagonist or inhibitor of the expression of a 
gene encoding a polypeptide essential for bacterial growth or survival 
wherein said gene is selected from the group consisting of ygbB, yfhC, 
yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO. yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA. yagF, b1983, yidD, yceG and/or yjbC, the 
sequence of said genes being shown in Fig. 1, or a fragment, derivative or 
orthoiog thereof, said method comprising the steps of 

(a) testing a candidate antagonist or inhibitor or a sample comprising 
a plurality of said candidate antagonists or inhibitors for the 
inhibition or reduction of transcription of said gene or a fragment 
or derivative thereof; or 

(b) testing a candidate antagonist or inhibitor or a sample comprising 
a plurality of said candidate antagonists or inhibitors for the 
inhibition or reduction of translation of mRNA transcribed from 
said gene or a fragment or derivative thereof; and 

(c) identifying an antagonist or inhibitor or a sample comprising a 
plurality of said candidate antagonists or inhibitors that tests 
positive in step (a) and/or (b). 

A method for testing a candidate antagonist or inhibitor of a polypeptide or 
a mRNA essential for bacterial growth or survival encoded by a gene 
selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl. 
yggJ, yjeE, yiaO, yrdC, yhbC, ygbP. ybeY, gcpE, kdtB, pfs, ycaJ, b1808, 
yeaA, yagF, b1983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or a fragment, derivative or orthoiog thereof 
comprising the steps of 

(a) contacting a bacterial cell with a candidate antagonist or inhibitor 
or a sample comprising a plurality of said candidate antagonists 
or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition 
and/or cell death. 
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3. A method for testing a candidate antagonist or inhibitor of the function of a 
gene essential for bacterial growth or survival wherein said gene is 
selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, yrfi, 
yggJ. yjeE, yiaO, yrdC. yhbC, ygbP, ybeY, gcpE, kdtB, pfs. ycaJ, b1808. 
yeaA, yagF, b1983, yidD. yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or a fragment, derivative or ortholog thereof, 
comprising the steps of 

(a) contacting a bacterial cell comprising said gene with a candidate 
antagonist or inhibitor or a sample comprising a plurality of said 
candidate antagonists or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition 
and/or cell death. 

4. The method of any one of claims 1 to 3 further comprising identifying an 
antagonist or inhibitor, optionally from said sample of candidate 
antagonists or inhibitors. 

5. The method of any one of claims 1 to 4 wherein said inhibitor or 
antagonist is further improved by peptidomimetics or by applying phage 
display or combinatorial library technique step(s). 

6. A method for designing an improved antagonist or inhibitor for the 
treatment of a bacterial infection or disorder or disease related to a 
bacterial infection comprising the steps 

(a) identification of the binding site of an antagonist or inhibitor to the 
polypeptide ygbB, yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, 
yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA. 
yagF, b1 983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or obtained by or identified by the method 
of any one of claims 1 to 5 by site-directed mutagenesis and 
chimeric polypeptide studies; 

(b) molecular modeling of both the binding site of said antagonist or 
inhibitor and the stnjcture of said polypeptide; and 
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modification of said antagonist or inhibitor to improve its binding 
specificity or affinity for the polypeptide. 

An antagonist or inhibitor of the activity of a polypeptide encoded by a 
gene selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, 
yrfl, yggJ. yjeE, ylaO, yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, 
b1808. yeaA, yagF, b1983, yidD. yceG and/or yjbC. the sequence of said 
genes being show/n in Fig. 1 , or fragment, derivative or ortholog thereof or 
of the expression of a gene encoding said polypeptide or said fragment, 
derivative or ortholog or obtained by or identified by the method of any 
one of claims 1 to 6. . 

A method for producing a therapeutic agent comprising synthesizing the 
antagonist or inhibitor identified, tested or designed according to the 
method of any one of claims 1 to 6 or the antagonist or inhibitor of claim 7 
or an analog or derivative thereof. 

A method for producing a composition comprising the steps of the method 
of any one of claims 1 to 6 or synthesizing the antagonist or inhibitor pf 
claim 7 and formulating said inhibitor or antagonist in a pharmaceutically 
acceptable form. 

A composition comprising an antagonist or inhibitor of claim 7, the 
therapeutic agent produced by the method of claim 8 or the antagonist or 
inhibitor obtained by or identified in the method of any one of claims 1 to 6 
or produced according to claim 9 and optionally a pharmaceutically 
acceptable carrier. 

The composition of claim 10 which is a pharmaceutical composition. 
The composition of claim 10 which is a kit. 
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The composition of any one of claims 10 to 12 further comprising an 
antibiotic and/or cytokine. 
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14. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychB, yejD. yrfl, yggj, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs. ycaJ. b1808, yeaA, yagF, b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown In Fig. 
1 , or a fragment, derivative or ortholog thereof or of any of said genes for 
the identification of an antagonist or inhibitor of the activity of said 
polypeptide or said fragment, derivative or ortholog or of the expression of 
a gene encoding said polypeptide or said fragment, derivative or ortholog. 

15. Use of an antagonist or inhibitor of claim 7, the therapeutic agent 
produced by the method of claim 8 or the antagonist or inhibitor obtained 
by or identified in the. method of any one of claims 1 to 6 or produced 
according to claim 9 or identified by the use of any of the claims for the 
preparation of a pharmaceutical composition for the treatment of (a) 
bacterial infection{s), disorder(s) and/or disease(s) related to bacterial 
infections. 

16. A method for treating or preventing bacterial infections or diseases or 
disorders related to bacterial infections comprising the step of 
administering to a subject in need thereof the antagonist or inhibitor 
obtained by or identified in the method of any one of claims 1 to 6 or 
produced according to claim 9 optionally comprised in the pharmaceutical 
composition according to claim 1 1. 

17. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychS, yejD, yrfl. yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ. b1808, yeaA, yagF, b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1, or a fragment, derivative or ortholog thereof or any of said genes for 
screening for polypeptides interacting with said polypeptide using protein- 
protein interaction technologies, and/or for validating such interaction as 
being essential for bacterial survival and/or for screening for antagonists 
or inhibitors of such interaction. 



wo 00/61793 



39 



PCT/EPOO/03135 



18. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY. gcpE. kdtB, pfs, ycaJ, b1808, yeaA, yagF. b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1, or a fragment, derivative or ortholog thereof or any of said genes for 
screening of polypeptide for polypeptide binding to said polypeptide, 
and/or for validating the peptides binding to said polypeptide as 
preventing growth of bacteria or being lethal to bacteria upon expression 
of said polypeptides in said bacteria, and/or for screening for small 
molecules competitively displacing said peptides. 

1 9. Use of conditional mutants in a gene selected from the group consisting of 
ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, 
ybeY. gcpE, kdtB, pfs, yea J, b1808, yeaA, yagF, b1983. yidD, yceG 
and/or yjbC, the sequence of said genes being shown in Fig., 1, or a 
fragment, derivative or ortholog thereof or of surrogate ligands against 
said gene expressed in bacteria to induce a lethal phenotype in bacteria 
and/or for the analysis of said bacteria for surrogate markers by 
comparison of RNA or protein profiles in said bacteria with RNA or protein 
profiles in wild type bacteria, and/or the use of said surrogate markers for 
the identification of antagonists of the essential function of said gene. 

20. A method for identifying or isolating a surrogate marker comprising the 
steps as described in claim 19. 

21. A method for identifying or isolating a surrogate marker comprising the 
steps of 

(a) inducing a lethal phenotype in bacteria containing a conditional 
mutant of a gene selected from the group consisting of ygbB, yfhC, 
yacE, ychB, yejD. yrfl, yggJ, yjeE, yiaO, yrdC, yhbC. ygbP. ybeY, 
gcpE, kdtB, pfs, ycaJ, b1808, yeaA. yagF, b1983, yidD, yceG and/or 
yjbC, the sequence of said genes being shown in Fig. 1; and 
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(b) analysing said bacteria comparing the RNA or protein profile of said 
bacteria with wild type bacteria. 



It" 
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1/15 
Fig. 1 

1) ygbB 

atgcgaattggacacggttttgacgtacatgcctttggcggtgaaggcccaattatcattggtggcgtacgcattccttacgaaaa 

aggattgctggcgcattctgatggcgacgtggcgctccatgcgttgaccgatgcattgcttggcgcggcggcgctgggggatat 

cggcaagctgttcccggataccgatccggcatttaaaggtgccgatagccgcgagctgctacgcgaagcctggcgtcgtattc 

aggcgaagggttatacccttggcaacgtcgatgtcactatcatcgctcaggcaccgaagatgttgccgcacattccacaaatg 

cgcgtgtttattgccgaagatctcggctgccatatggatgatgttaacgtgaaagccactactacggaaaaactgggatttaccg 

gacgtggggaagggattgcctgtgaagcggtggcgctactcattaaggcaacaaaatga (SEQ ID NO: 16) 

2) yfhC 

atgcgccgcgcttttataaccggagttttctttttgtctgaagtcgaatttagccacgaatactggatgcgtcacgcgctgacgctg 

gcgaaacgtgcctgggatgagcgggaagtgccggtcggcgcggtattagtgcataacaatcgggtaatcggcgaaggctg 

gaaccgcccgattggtcgccatgatcccaccgcacatgcagaaatcatggccctgcggcagggtggtctggtgatgcaaaat 

tatcgtctgatcgacgccacgttgtatgtcacgcttgaaccatgtgtaatgtgtgccggagcgatgatccacagtcgcattggtcg 

cgtggtctttggtgcgcgtgacgcgaaaactggcgctgcgggatctttaatggatgtgctgcatcatccgggtatgaatcaccga 

gtggaaattacggaaggaatactggcggatgagtgcgcggcgttgctcagtgacttctttcgcatgcgccgccaggaaattaa 

agcgcagaaaaaagcgcaatcctcgacggattaa (SEQ ID NO: 17) 

3) yacE 

atgaggtatatagttgccttaacgggaggcattggcagtggcaagagtaccgttgccaatgcgtttgctgatctcggaattaacg 

tcattgatgccgatattattgcgcgtcaggtggttgaaccaggtgcacctgcgctacatgccattgctgatcactttggcgctaaca 

tgattgctgctgatggaacattgcagcgccgggccttgcgcgagcggatcttcgccaacccggaagagaaaaactggcttaa 

cgccctgctgcatccgctgattcagcaagagacgcaacaccagatccagcaagctacttccccctatgtactgtgggttgtgcc 

attgctggtagaaaactcactgtataaaaaagcgaatcgagtgcttgtggtggatgtcagcccagaaacgcaacttaagcgc 

accatgcagcgcgatgatgtaactcgcgagcatgtcgaacaaatccttgctgctcaggcaacgcgcgaagcccgccttgccg 

tggcagatgacgtcattgataataacggcgcaccggatgctatcgcatcggatgttgcccgcctgcacgcacactatttgcagc 

ttgcgtcgcagtttgtctcacaggaaaaaccgtaa (SEQ ID NO: 18) 

4) ychB 

atgcggacacagtggccctctccggcaaaacttaatctgtttttatacattaccggtcagcgtgcggatggttaccacacgctgc 

aaacgctgtttcagtttcttgattacggcgacaccatcagcattgagcttcgtgacgatggggatattcgtctgttaacgcccgttg 

aaggcgtggaacatgaagataacctgatcgttcgcgcagcgcgattgttgatgaaaactgcggcagacagcgggcgtcttcc 

gacgggaagcggtgcgaatatcagcattgacaagcgtttgccgatgggcggcggtctcggcggtggttcatccaatgccgcg 

acggtcctggtggcattaaatcatctctggcaatgcgggctaagcatggatgagctggcggaaatggggctgacgctgggcg 

cagatgttcctgtctttgttcgggggcatgccgcgtttgccgaaggcgttggtgaaatactaacgccggtggatccgccagaga 

agtggtatctggtggcgcaccctggtgtaagtattccgactccggtgatttttaaagatcctgaactcccgcgcaatacgccaaa 

aaggtcaatagaaacgttgctaaaatgtgaattcagcaatgattgcgaggttatcgcaagaaaacgttttcgcgaggttgatgc 

ggtgctttcctggctgttagaatacgccccgtcgcgcctgactgggacaggggcctgtgtctttgctgaatttgatacagagt^^ 

aagcccgccaggtgctagagcaagccccggaatggctcaatggctttgtggcgaaaggcgctaatctttccccattgcacaq 
agccatgcttlaa (SEQ ID NO: 19) 

5) yejD 

atgcgacttgataaatttatcgcacagcaactcggcgttagccgtgctattgccgggcgtgaaatccgcggcaatcgtgtcacc 

gtcgatggcgaaatcgtccgtaatgcagcgttcaaactgcttcctgaacatgatgtcgcttacgatggcaacccgctggcgcag 

caacacggtccacgttacttcatgctcaataagcctcagggctatgtttgctccacggacgaccctgatcacccaacggtgctct 

attttcttgatgaaccggtagcgtggaaactgcatgcggcggggcggttggatattgataccaccggtctggtgctgatgactga 

tgatggtcagtggtcgcaccgcattacttctccgcgccatcattgcgagaagacctatctggtgacactggaatcacctgtagct 

gacgatacggcagagcaatttgctaaaggcgtgcagctgcataacgaaaaagatctcactaagcctgcggtgctggaagtg 

attaccccaacgcaggttcgtctgaccatcagcgaagggcgttatcatcaggtgaaacgcatgttcgccgccgtgggtaacca 

cgtggttgagctgcatcgtgaacgtattggcggtattacgctggatgctgatttagcccccggtgaatatcgtccgttaactgaag 

aagaaattgccagcgtcgtctaa (SEQ ID NO: 20) 
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6) yrf I 

atgattatgccgcaacatgaccaattacatcgctatctgtttgaaaactttgccgtgcgcggcgaactggtaaccgtttcggaaa 

ccctgcaacagatccttgagaaccacgattatccgcagcccgttaaaaacgtgctggcagaactgctggttgcgaccagcct 

gttaaccgctacgctgaagtttgatggtgatatcaccgtacagctgcagggcgacggtccgatgaatctggcggttattaacggt 

aacaataaccagcagatgcgcggtgtggcgcgcgtgcagggcgaaattccagaaaatgccgacctgaaaacgctggtcg 

gcaatggttacgtggtgatcaccattaccccgagcgaaggcgaacgctatcagggcgtagttggtctggaaggtgataccctg 

gcggcctgcctggaagattactttatgcgttctgaacagctgccgacgcgcctgtttattcgcaccggcgacgtagacggcaaa 

ccggctgcaggcggtatgttgttgcaggtaatgcctgcgcaaaatgcccagcaggacgactttgaccacctggcgacgctaa 

ccgaaaccatcaaaaccgaagaactgctgaccttaccggcaaacgaagtgttgtggcgtttgtatcacgaagaagaggtga 

cggtttacgatccgcaggatgtggagttcaaatgcacctgctcgcgtgaacgttgcgccgatgcgctgaaaacgctgcctgatg 

aagaagttgatagcatcctggcggaagatggcgaaattgacatgcattgtgattactgcggtaaccactatctgttcaatgcgat 

ggatattgctgaaatccgcaacaacgcgtctccggcagatccgcaagttcattaa (SEQ ID NO: 21) 

7) yggJ 

gtggggagacgacgcggatttttaactatgcgtatcccccgcatttatcatcctgaaccactgaccagccattctcacatcgcgct 

ttgcgaagatgccgccaaccatatcgggcgcgtactgcgcatggggccggggcaggcgttgcaattgtttgacggtagcaac 

caggtctttgacgccgaaattaccagcgccagcaaaaaaagcgtggaagtgaaggtgctggaaggccagatcgacgatcg 

cgaatctccgctgcatattcacctcggtcaggtgatgtcgcgtggtgaaaaaatggaatttactatccagaaatcgatcgaactc 

ggtgtaagcctcattacgccacttttttctgagcgctgcggcgttaaactggatagtgaacgtctgaacaagaagcttcagcagt 

ggcagaagattgcaattgctgcctgtgagcagtgtggtcgtaaccgggtgccggaaatccgtccagcgatggatctggaagc 

ctggtgtgcagagcaggatgaaggactgaaactgaatcttcacccgcgcgccagtaacagcatcaatacgttgccgttaccg 

gttgaacgcgtccgcctgctgattggcccggaaggcggtttatcggcagatgaaattgccatgactgcccgctatcaatttactg 

atatcctgttgggacctcgcgttttgcgtacagagacaactgcgctcaccgccattaccgcgctacaagtacgatttggcgatttg 

ggctaa (SEQ iD NO: 22) 

8) yjeE 

atgatgaatcgagtaattccgctccctgatgagcaggcaacattagacctgggcgagcgggtagcgaaagcctgcgatggc 

gcaaccgtaatctatctgtatggcgatttaggcgcaggtaaaaccacctttagccggggctttttacaggctctgggtcatcagg 

gtaatgtcaaaagccccacttatacgctggtcgaaccctatacgctcgacaacttaatggtctatcactttgatttgtaccgccttg 

ccgatcccgaggagctggagtttatggggatccgcgattattttgccaacgatgccatctgcctggtggagtggccacaacaa 

ggtacaggtgttcttcctgacccggatgtcgaaatacacattgattatcaggcacaaggccgtgaggcgcgcgtgagtgcggtt 

tcctctgcgggtgaattgttgctggcgcgtttagccggttaa 

(SEQ ID NO: 23) 

9) yiaO 

atgaaattacgctctgtaacctacgcattattcattgctggcctggctgcattcagcacatcttctctggcggcacaatctttacgttt 

cggttatgaaacatcacaaaccgactcgcaacatattgcggcgaaaaaattcaatgatttattgcaggagagaaccaaaggc 

gagctgaaattaaaactgttcccggacagcactctcggtaacgcgcaggcgatgatcagcggcgtacgtggcggcaccatc 

gatatggaaatgtccggctcgaataactttgccgggttatcaccagtgatgaacttgcttgatgtccctttcctgttccgcgataccg 

ctcacgcgcataaaacgctcgacggcaaagtcggtgatgatctgaaagcctcacttgaaggtaaaggactgaaagtactgg 

cctactgggaaaacggctggcgcgatgtcaccaactcgcgcgGaccggttaaaacccccgccgacctgaaagggctgaaa 

atccgcaccaacaatagcccgatgaatatcgccgcattcaaagtctttggcgctaacccgatcccgatgccgtttgccgaagtc 

tataccgggctggaaacccgcactatcgacgctcaggaacacccgatcaacgtcgtctggtcagcaaaatttttcgaagtgca 

gaagttcctttctctgacgcaccacgcctattccccgcttctggtggtgatcaacaaagcgaagtttgatggcttaagtccggagtt 

ccagcaggcgctagtttcatctgcacaagaagcgggtaactatcagcgcaaactggttgctgaagatcagcaaaaaatcatc 

gacggcatgaaagaagcgggcgtggaagtcatcaccgatctcgaccgcaaagcctttagcgacgcactggggaatcaggt 

tcgcgacatgtttgttaaagatgtgccgcagggagctgatctgctgaaagccgtggatgaggtgcaataa (SEQ ID NO: 

24) 

10) yrdC 

gtgaataataacctgcaaagagacgctatcgcagctgcgatagatgttctcaatgaagaacgtgtcatcgcctatccaacgga' 
■ agccgttttGggtgttgggtgcgatcctgatagcgaaacagcagtgatgcgactgttggagttaaaacagcgtccggttgataa 
ggggctgattttaatcgcagcaaattacgagcagcttaaaccctatattgatgacaccatgttgactgacgtgcagcgtgaaac 
cattttttcccgctggccaggtcctgtcacctttgtctttcccgcgcctgcgacaacaccgcgctggttgacgggccgctttgattcg 
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cttgctgtacgagtcaccgaccatccgttggtggttgctttgtgccaggcttatggtaaaccgctggtttctaccagtgccaacttga 
gtggattgccaccttgtcgaacagtagacgaagttcgcgcacaatttggcgcggcgttcccggttgtgcctggtgaaacgggg 
gggcgtttaaatccttcagaaatccgcgatgccctgacgggtgaactgtttcgacaggggtaa (SEQ ID NO: 25) 

11) yhbC 

gtgggcttgtccacattagagcaaaaattaacagagatgattactgcgccagttgaggccctgggttttgaactggttggcatcg 

aatttattcgcggtcgcacatccacactgcgcatctatattgatagtgaagatggcatcaatgttgatgattgtgctgatgtgagcc 

accaggtaagtgctgtgctggatgttgaagatcccatcaccgttgcttataacctggaagtctcctcaccgggtctcgatcgccc 

actgttcacggctgaacactacgcccgttttgtcggagaagaggtgactctggttctccgtatggcggtacaaaaccgtcgtaa 

atggcagggcgttatcaaagcggtagacggtgaaatgatcacagttaccgtcgaaggtaaagatgaagtgttcgcgctgagt 

aatatccagaaggcgaacctggttccccacttttaa (SEQ ID NO: 26) 

12) ygbP 

atggcaaccactcatttggatgtttgcgccgtggttccggcggccggatttggccgtcgaatgcaaacggaatgtcctaagcaa 

tatctctcaatcggtaatcaaaccattcttgaacactcggtgcatgcgctgctggcgcatccccgggtgaaacgtgtcgtcattgc 

cataagtcctggcgatagccgttttgcacaacttcctctggcgaatcatccgcaaatcaccgttgtagatggcggtgatgagcgt 

gccgattccgtgctggcaggtctgaaagccgctggcgacgcgcagtgggtattggtgcatgacgccgctcgtccttgtttgcatc 

aggatgacctcgcgcgattgttggcgttgagcgaaaccagccgcacgggggggatcctcgccgcaccagtgcgcgatacta 

tgaaacgtgccgaaccgggcaaaaatgccattgctcataccgttgatcgcaacggcttatggcacgcgctgacgccgcaattt 

ttccctcgtgagctgttacatgactgtctgacgcgcgctctaaatgaaggcgcgactattaccgacgaagcctcggcgctggaa 

tattgcggattccatcctcagttggtcgaaggccgtgcggataacattaaagtcacgcgcccggaagatttggcactggccga 

gttttacctcacccgaaccatccatcaggagaatacataa (SEQ ID NO: 27) 

13) ybeY 

atgagtcaggtgatcctcgatttacaactggcatgtgaagataattccgggttaccggaagagagccagtttcagacatggctg 
aatgcggtgatcccgcagtttcaggaagaatcggaagtgacgattcgcgtggtcgataccgccgaaagccacagtctgaatc 
tgacctatcgcggtaaggataagccgaccaacgtgctctccttcccgtttgaagtgccgcctggcatggaaatgtcgctactgg 
gcgatctggttatctgccgtcaggtggttgagaaggaagctcaggagcaaggcaaaccactggaggcgcactgggcgcata 
tggtggtgcacggcagtctgcatttgttaggttacgatcacatcgaagatgacgaagcagaagaaatggaagccctcgaaac 
agagattatgcttgctctgggctatgaggatccgtacattgccgagaaagaataa (SEQ ID NO: 28) 

14) gcpE 

atgcataaccaggctccaattcaacgtagaaaatcaacacgtatttacgttgggaatgtgccgattggcgatggtgctcccatc 

gccgtacagtccatgaccaatacgcgtacgacagacgtcgaagcaacggtcaatcaaatcaaggcgctggaacgcgttgg 

cgctgatatcgtccgtgtatccgtaccgacgatggacgcggcagaagcgttcaaactcatcaaacagcaggttaacgtgccg 

ctggtggctgacatccacttcgactatcgcattgcgctgaaagtagcggaatacggcgtcgattgtctgcgtattaaccctggca 

atatcggtaatgaagagcgtattcgcatggtggttgactgtgcgcgcgataaaaacattccgatccgtattggcgttaacgccgg 

atcgctggaaaaagatctgcaagaaaagtatggcgaaccgacgccgcaggcgttgctggaatctgccatgcgtcatgttgat 

catctcgatcgcctgaacttcgatcagttcaaagtcagcgtgaaagcgtctgacgtcttcctcgctgttgagtcttatcgtttgctgg 

caaaacagatcgatcagccgttgcatctggggatcaccgaagccggtggtgcgcgcagcggggcagtaaaatccgccattg 

gtttaggtctgctgctgtctgaaggcatcggcgacacgctgcgcgtatcgctggcggccgatccggtcgaagagatcaaagtc 

ggtttcgatattttgaaatcgctgcgtatccgttcgcgagggatcaacttcatcgcctgcccgacctgttcgcgtcaggaatttgatg 

ttatcggtacggttaacgcgctggagcaacgcctggaagatatcatcactccgatggacgtttcgattatcggctgcgtggtgaa 

tggcccaggtgaggcgctggtttctacactcggcgtcaccggcggcaacaagaaaagcggcctctatgaagatggcgtgcg 

caaagaccgtctggacaacaacgatatgatcgaccagctggaagcacgcattcgtgcgaaagccagtcagctggacgaa 

gcgcgtcgaattgacgttcagcaggttgaaaaataa 

(SEQ ID NO: 29) 

15) kdtB 

atgcaaaaacgggcgatttatccgggtactttcgatcccattaccaatggtcatatcgatatcgtgacgcgcgccacgcagatgt 
tcgatcacgttattctggcgattgccgccagccccagtaaaaaaccgatgtttaccctggaagagcgtgtggcactggcacag 
caggcaaccgcgcatctggggaacgtggaagtggtcgggtttagtgatttaatggcgaacttcgcccgtaatcaacacgctac 
ggtgctgattcgtggcctgcgtgcggtggcagattttgaatatgaaatgcagctggcgcatatgaatcgccacttaatgccggaa 
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ctggaaagtgtgtttctgatgccgtcgaaagagtggtcgtttatctcttcatcgttggtgaaagaggtggcgcgccatcagggcg 
atgtcacccatttcctgccggagaatgtccatcaggcgctgatggcgaagttagcgtag (SEQ ID NO: 30) 

16) pfs 

atgaaaatcggcatcattggtgcaatggaagaagaagttacgctgctgcgtgacaaaatcgaaaaccgtcaaactatcagtc 

^cggcggttgcgaaatctataccggccaactgaatggaaccgaggttgcgcttctgaaatcgggcatcggtaaagtcgctgcg 

gcgctgggtgccactttgctgttggaacactgcaagccagatgtgattattaacaccggttctgccggtggcctggcaccaacgt 

tgaaagtgggcgatatcgttgtctcggacgaagcacgttatcacgacgcggatgtcacggcatttggttatgaatacggtcagtt 

accaggctgtccggcaggctttaaagctgacgataaactgatcgctgccgctgaggcctgcattgccgaactgaatcttaacg 

ctgtacgtggcctgattgttagcggcgacgctltcatcaacggttctgttggtctggcgaaaatccgccacaacttcccacaggc 

cattgctgtagagatggaagcgacggcaatcgcccatgtctgccacaatttcaacgtcccgtttgttgtcgtacgcgccatctcc 

gacgtggccgatcaacagtctcatcttagcttcgatgagttcctggctgttgccgctaaacagtccagcctgatggttgagtcac^ 

ggtgcagaaacttgcacatggctaa (SEQ ID NO: 31) 

17) yea J 

gtgagcaatctgtcgctcgatttttcggataatacttttcaacctctggccgcgcgtatgcggccagaaaatttagcacagtatatc 

ggccagcaacatttgctggctgcggggaagccgttgccgcgcgctatcgaagccgggcatttacattctatgatcctctggggg 

ccgccgggtaccggcaaaacaactctcgctgaagtgattgcccgctatgcgaacgctgatgtggaacgtatttctgccgtcac 

ctctggcgtgaaagagattcgcgaggcgatcgagcgcgcccggcaaaaccgcaatgcaggtcgccgcactattctttttgttg - 

acgaagttcaccgtttcaacaaaagccagcaggatgcatttctgccacatattgaagacggcaccatcacttttattggcgcaa 

ccactgaaaacccgtcgtttgagcttaattcggcactgctttcccgtgcccgtgtctatctgttgaaatccctgagtacagaggata 

ttgagcaagtactaactcaggcgatggaagacaaaacccgtggctatggtggtcaggatattgttctgccagatgaaacacg 

acgcgccattgctgaactggtgaatggcgacgcgcgccgggcgttaaatacgctggaaatgatggcggatatggccgaagt 

cgatgatagcggtaagcgggtcctgaagcctgaattactgaccgaaatcgccggtgaacgtagcgcccgctttgataacaaa 

ggcgatcgcttttacgatctgatttccgcactgcataagtcggtacgtggtagcgcacccgatgcggcgctgtactggtatgcgc 

gaattattaccgctggtggcgatccgttatatgtcgcgcgtcgctgtctggcgattgcgtctgaagacgtcggtaatgccgatcca 

cgggcgatgcaggtggcaattgcggcctgggattgctttactcgcgttggcccggcggaaggtgaacgcgccattgctcaggc 

gattgtttacctggcctgcgcgccaaaaagcaacgctgtctacactgcgtttaaagccgcgctggccgatgctcgcgaacgcc 

cggattatgacgtgccggttcatttgcgtaatgcgccgacgaaattaatgaaggaaatgggctacgggcaggaatatcgttac 

gctcatgatgaagcaaacgcttatgctgccggtgaggtttacttcccgccggaaatagcacaaacacgctattatttcccgaca 

aacaggggccttgaaggcaagattggcgaaaagctcgcctggctggctgaacaggatcaaaatagccccataaaacgcta 

ccgttaa (SEQ ID NO: 32) 

18) b1808 

gtgacggacgattttgcaGcagacggtcagctggcgaaagcgataccaggctttaagccgcgagaaccacagcgacagat 

ggcggtagccgtcacccaggcgatagaaaaaggccagccgctggtggtggaagcaggaaccggtacgggcaaaaccta 

cgcttacctggctcctgcgctgcgggcgaaaaagaaagtcattatctcgaccggctcaaaagcgttgcaggatcagctctaca 

gccgcgatttgccaacagtctcaaaggcattgaaatatacgggcaacgtggcgctgctgaaagggcgctcaaactacctctg 

cctcgaacgtctcgaacagcaggcgctggcggggggcgatctgccggtacaaatcttaagcgatgtgatcctgctgcgctcct 

ggtctaatcaaacagtcgatggtgatatcagcacctgcgtcagcgtggcggaagattcacaggcgtggccgctggtcaccag 

caccaacgacaactgtcttggcagcgactgcccgatgtataaagattgctttgtggtcaaagcacgtaaaaaagcgatggac 

gccgatgtggtggtggtaaaccatcatctctttctggcggatatggtggttaaagagagtggatttggcgaactgatcccggaag 

cggacgtcatgatcttcgacgaagcccaccagctaccggacattgccagccagtattttggtcagtcactctccagtcgacaac 

tgctcgacctggcaaaagacatcaccatcgcctaccgcaccgaattaaaagacacccagcagttacaaaagtgcgctgatc 

gtcttgcccagagtgcgcaggattttcgtctgcaactcggtgagccaggttatcgcggtaacctgcgtgagctgttagctaatccg 

caaattcagcgggcatttttactgctcgatgacaccctggaactttgttatgacgtggcgaaactgtcactggggcgttccgccttg 

ctggatgcggcatttgagcgcgccacgttgtatcgcacacggctgaagcggctaaaagagatcaatcagccgggctacagct 

actggtacgaatgcacttcgcgccattttactctggctctcacgccgctcagcgtggcggataaattcaaagagttaatggcgca 

aaaacccggtagctggatcttcacctcagcaacgctgtcggtgaacgacgatctgcatcatttcacctcgcggcttggcatcga 

acaggccgagtcgttgctgttgcccagcccatttgattacagccgccaggcgttactctgtgtgctgcgcaatctgccgcaaacc 

aaccagccaggttctgctcgccagttagcggcaatgctgcgaccgatcatcgaagctaacaacggtcgttgttttatgctttgtac 

ctcgcacgccatgatgcgcgatctggccgagcagttccgcgctaccatgacgcttcctgtattgttgcagggggaaaccagca 

aagggcaactgttgcagcaatttgtcagcgccggtaatgcgcttcttgtggcaaccagcagtttctgggaaggggtggacglgc. 

gtggcgatacattgtcattggtaattatcgacaaattgccgtttacctcgccggatgatccactgttaaaagcgcgcatggaagat 
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tgtcgtttgcgcggtggcgacccgttcgatgaagtgcaactaccagatgccgtcattactctcaaacagggggtagggcgact 
gattcgcgacgccgacgatcgtggcgtgctggtgatttgtgacaatcggctggtgatgcgtccttacggcgcgacgtttctcgcc 
agtctgccgcccgcgccacgcacccgtgacattgcccgtgcggttcgtttccttgcgataccatcctccaggtaa (SEQ ID 



1 9) yeaA 

atggctaataaaccttcggcagaagaactgaaaaaaaatttgtccgagatgcagttttacgtgacgcagaatcatgggacag 
aaccgccatttacgggtcgtttactgcataacaagcgtgacggcgtatatcactgtttgatctgcgatgccccgctgtttcattccc 
aaaccaagtatgattccggctgtggctggcccagtttctacgaaccggtaagtgaagaatccattcgttatatcaaagacttgtc 
acatggaatgcagcgcatagaaattcgttgcggtaactgtgatgcccatctggggcatgtcttccccgacgggccgcagccaa 
cgggcgaacgttattgtgttaactctgcctctttacgctttaccgatggcgaaaacggcgaagaaatcaacggttga (SEQ 
ID NO: 34) 

20) yagF 

atgaccattgagaaaattttcaccccgcaggacgacgcgttttatgcggtgatcacccacgcggcggggccgcagggcgctc 

tgccgctgaccccgcagatgctgatggaatctcccagcggcaacctgttcggcatgacgcagaacgccgggatgggctggg 

acgccaacaagctcaccggcaaagaggtgctgattatcggcactcagggcggcatccgcgccggagacggacgcccaat 

cgcgctgggctaccacaccgggcattgggagatcggcatgcagatgcaggcggcggcgaaggagatcacccgcaatggc 

gggatcccgttcgcggccttcgtcagcgatccgtgcgacgggcgctcgcagggcacgcacggtatgttcgattccctgccgta 

ccgcaacgacgcggcgatcgtgtttcgccgcctgatccgctccctgccgacgcggcgggcggtgatcggcgtagcgacctg 

cgataaagggctgcccgccaccatgattgcgctggccgcgatgcacgacctgccgactattctggtgccgggcggggcgac 

gctgccgccgaccgtcggggaagacgcgggcaaggtgcagaccatcggcgcgcgtttcgccaaccacgaactctccctgc 

aogaggccgccgaactgggctgtcgcgcctgcgcctcgccgggcggcgggtgtcagttcctcggcacggcgggcacctcg 

caggtggtcgcggaggcgctgggtctggcgctgccgcactccgcgctggcgccgtccgggcaggcggtgtggctggagatc 

gcccgccagtcggcgcgcgcggtcagcgagctggatagccgcggcatcaccacgcgggatatcctctccgataaagccat 

cgaaaacgcgatggtgatccacgcggcgttcggcggctccaccaatttactgctgcacattccggccatcgcccacgcggcg 

ggctgcacgatcccggacgttgagcactggacgcgcatcaaccgtaaagtgccgcgtctggtgagcgtgctgcccaacggc 

ccggactatcacccgaccgtgcgcgccttcctcgcgggcggcgtgccggaggtgatgctccacctgcgcgacctcggcctgc 

tgcatctggacgccatgaccgtgaccggccagacggtgggcgagaaccttgaatggtggcaggcgtccgagcgccgggcg 

cgcttccgccagtgcctgcgcgagcaggacggcgtagagccggatgacgtgatcctgccgccggagaaggcaaaagcga 

aagggctgacctcgacggtctgcttcccgacgggcaacatcgctccggaaggttcggtgatcaaggccacggcgatcgacc 

CQtcggtggtgggcgaagatggcgtataccaccacaccggccgggtgcgggtgtttgtctcggaagcgcaggcgatcaagg 

cgatcaagcgggaagagattgtgcagggcgatatcatggtggtgatcggcggcgggccgtccggcaccggcatggaagag 

acctaccagctcacctccgcgctaaagcatatctcgtggggcaagacggtgtcgctcatcaccgatgcgcgcttctcgggcgt 

gtcgacgggcgcctgcttcggccacgtgtcgccggaggcgctggcgggcgggccgattggcaagctgcgcgataacgaca 

tcatcgagattgccgtggatcgtctgacgttaactggcagcgtgaacttcatcggcaccgcggacaacccgctgacgccgga 

agagggcgcgcgcgagctggcgcggcggcagacgcacccggacctgcacgcccacgactttttgccggacgacacccg 

gctgtgggcggcactgcagtcggtgagcggcggcacctggaaaggctgtatttatgacaccgataaaattatcgaggtaatta 

acgccggtaaaaaagcgctcggaatttaa (SEQ ID NO: 35) 

21) b1983 

gtgggacgtaaatgggccaatattgttgctaaaaaaacggctaaagacggtgcaacgtctaaaatttatgcaaaattcggtgta 

gaaatctatgctgctgctaaacaaggtgaacccgatccagaattaaacacatctttaaaattcgttattgaacgtgcaaagcag 

gcacaagttccaaagcacgttattgataaagcaattgataaagccaaaggcggcggagatgaaacgttcgtgcagggacgt 

tatgaaggctttggtcctaatggctcaatgattatcgccgagacattgacttcaaatgttaaccgtacgattgctaacgttcg^^ 

attttcaataaaaaaggcggcaatatcggagcggcaggttctgtcagctatatgtttgacaatacgggtgtgattgtatttaaagg 

gacagaccctgaccatatttttgaaattttacttgaagctgcfagttgatgttcgtgatgtgactgaagaagaaggtaacattgttatt 

tatactgaacctactgaccttcataaaggaatcgcggctctaaaagcagctggaatcactgagttctcaacaacagaattaga 

aatgattgctcaatctgaagttgagctttccccagaagatttagaaatctttgaagggcttgttgatgcccttgaagatgacgacg 

atgtacaaaaagtttatcataacgtcgcaaatctctaa (SEQ ID NO: 36) 

22) yidD 

atggcgccgccactgtcgcctggctcgcgggtcctgatagccctcattcgggtctatcaacgcctgattagtccgctactcgggc 
cgcattgtcgtttcactccaacctgttcaagctacggaattgaggcattgcgcaggtttggagtgataaaaggcagttggttgacg 
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gtgaaacgcgtattaaaatgccaccctttacaccctggtggtgacgatcccgtcccgcccggaccatttgataccaqaqaaca 
ctaa (SEQ ID NO: 37) » a y 



23) yceG 

atgaaaaaagtgttattgataatcttgttattgctggtggtactgggtatcgccgctggtgtgggcgtctggaaggttcgccatcttg 

ccgacagcaaattgcttatcaaagaagagacgatatttaccctgaagccagggaccggacgtctggcgctcggtgaacagc 

tttatgccgataagatcatcaatcgtccacgggtttttcaatggctgctgcgtatcgaaccggatctttctcactttaaagccg 

ttaccgctttacaccgcagatgaccgtgcgcgagatgctgaaattgctggaaagcggtaaagaagcacagttccctctgcgac 

tggtagaagggatgcgtctgagcgattacctcaagcaattgcgtgaggccccgtatatcaagcatacgctgagcgatgataag 

tacgccaccgtagcgcaggcacttgaactggaaaacccggagtggattgaaggttggttctggccagacacctggatgtata 

ccgccaataccaccgatgtcgcgttactcaagcgagcgcacaagaaaatggtgaaagcggtcgatagcgcctgggaagg 

gcgtgcggacggtctgccttataaagataaaaaccagttggtgacgatggcatcaattatcgaaaaagaaaccgccgttgcc 

agtgaacgcgataaggttgcctcagtatttatcaaccgtttacgcattggtatgcgcctgcagaccgacccgaccgtgatttacg 

ggatgggagagcgttataatggcaaactttctcgtgcagacctggaaacgccgacagcgtataacacctataccattaccggt 

ctgccgccaggtgcgatagcgacgccgggggcggattcgctgaaggctgctgcgcatccggcaaaaacgccgtatctctatt 

ttgtggccgatggtaaaggtggtcacacgtttaataccaatcttgccagtcataacaagtctgtgcaggattatctgaaagtgctt 

aaggaaaaaaatgcgcagtaa 

(SEQ ID NO:38) 

24) yjbC 

atgctgcccgactcatcagtccgtttaaataaatacatcagcgaaagcggaatttgctcacgccgcgaagcggatcgctatatc 

gagcaaggcaatgtgttccttaatggcaagcgagccaccattggcgatcaggtgaaacccggcgacgttgtgaaagtaaac 

ggtcagttgattgaacctcgggaagccgaagatttggtacttatcgccctgaacaagcccgttggtattgtaagcaccaccgaa 

gatggcgagcgcgataacattgtcgatttcgttaaccacagcaaacgcgtgttcccgattggccgcctggataaagactccca 

ggggctgattttcctcaccaatcacggcgatctggtgaataagatcctgcgtgctggcaatgatcatgagaaagagtatctggtg 

acggtcgataaaccgattaccgaggagtttattcgcggcatgagtgcgggggtgccaatcctcgggacagtgaccaaaaagt 

gcaaagttaaaaaagaagcgccgtttgtcttccgcattaccctggtgcaggggctgaaccgtcagatccggcgcatgtgcga 

gcatttcggctatgaagtgaaaaagctggaacgcacgcgcatcatgaacgttagcttaagcggcattccgctgggggaatgg 

cgcgatttaaccgacgatgagttaatcgacctctUaagctcattgaaaattcctcttccgaggtaaaacctaaagcgaaggcc 

aaaccgaaaacagcgggcatcaaacgtccagtcgttaagatggaaaaaacggcggaaaaaggcggtcgcccggcgtcc 

aacggtaagcgttttacctcgccggggcgtaaaaagaaggggcgctga (SEQ IDNO:39) 
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1 

SEQUENCE LISTING 

<110> GPC Biotech AG 

<120> Novel method, for identifying antibacterial compounds 

<130> D 1400 PCT 

<140> 
<141> 

<160> 45 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 



atggttataa atttggagtg tgaaggttat tgcgtgtaa 



<210> 2 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 2 

gttataaatt tggagtgtga aggttattgc gtg 



<210> 3 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 3 

cacgcaataa ccttcacact ccaaatttat aac 



<210> 4 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: artificia 
sequence 

<400> 4 

gatcggatcc aaattccagt tagccatgat gcggtc 



<210> 5 
<211> 54 
<212> DNA 

<213> Artificial Sequence 

<220> . ^ 

<223> Description of Artificial Sequence: artificial 

sequence 

<400> 5 - ^ ^ 

cacgcaataa cctucacact ccaaatttat aaccattata cacggacgct atgc 



<210> 6 
<211> 55 
<212> DNA 

<213> Artificial Sequence 



<220> . 
<223> Description of Artificial Sequence: artificial 

sequence 

<400> 6 ^. 
gttataaatt tggagtgtga aggttattgc gtgacggatt aattttgttt ctctt 



<210> 7 
<211> 37 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence: artificia 

sequence 
<400> 7 

gatcgtcgac gcgctcgata tcaccgatga acaaccg 



<210> 8 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> Description of Artificial Sequence: artificia 

sequence 
<400> 8 

caatccgctg ctttatttct gtcag 
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<210> 9 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 9 - ' 

ttataacgaa atcaacggga aacct 



<210> 10 . 
<211> 34 
<212> DWA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificia 
sequence 

<400> 10 

gatcggatcc ctctaaaaaa taaggaatta aagg 



<210> 11 
<211> 56 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 11 

cacgcaataa ccttcacact ccaaatttat aaccatagga tacctaatta att 



<210> 12 
<211> 54 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: artificia 
sequence 



<400> 12 

gttataaatt tggagtgtga aggttattgc gtgaagagcg ccatttccca ccgt 



<210> 13 

<211> 36 

<212> DNA 

<213> Artificial 



Sequence 



<220> 
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<223> Description cf Artificial Sequence: artificial 
sequence 

<400> 13 

gatcgtcgac tcatactcct gataacccgc tgcggt 



<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 14 

gttgacggcc aggccaacag teat 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 15 

ataaccctgg gcaatcgcct eg 



<210> 16 
<211> 480 
<212> DNA 

<213> Escherichia coli 
<400> 16 

atgcgaattg gacacggttt tgacgtacat gcctttggcg gtgaaggccc aattatcatt 60 

ggtggcgtac gcattcctta cgaaaaagga ttgctggcgc attctgatgg cgacgtggcg 120 

ctccatgcgt tgaccgatgc attgcttggc gcggcggcgc tgggggatat cggcaagctg 180 

ttcccggata ccgatccggc atttaaaggt gccgatagcc gcgagctgct acgcgaagcc 240 

tggcgtcgta ttcaggcgaa gggttatacc cttggcaacg tcgatgtcac tatcatcgct 300 

caggcaccga agatgttgcc gcacattcca caaatgcgcg tgtttattgc cgaagatctc 350 

-ggctgccata tggatgatgt taacgtgaaa gccactacta cggaaaaact gggatttacc 420 

ggacgtgggg aagggattgc ctgtgaagcg gtggcgctac tcattaaggc aacaaaatga 480 



<210> 17 
<211> 537 
<212> DNA 

<213> Escherichia coli 
<400> 17 

atgcgccgcg cttttataac cggagttttc 
tactggatgc gtcacgcgct gacgctggcg 
gtcggcgcgg tattagtgca taacaatcgg 
ggtcgccatg atcccaccgc acatgcagaa 



tttttgtctg aagtcgaatt tagccacgaa 60 
aaacgtgcct gggatgagcg ggaagtgccg 120. 
gtaatcggcg aaggctggaa ccgcccgatt 180 
atcatggccc tgcggcaggg tggtctggtg 240 
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atcgtctgat cgacgccacg ttgtatgtca cgcttgaacc atgtgtaatg 300 
cgatgatcca cagtcgcatt ggtcgcgtgg tctttggtgc gcgcgacgcg 360 



atgcaaaatt 
tgrgccggag 

aaaactggcg ctgcgcgatc tttaatggat gtgctgcatc atccgggtat gaatcaccga 
gtggaaatta cggaaggaat actggcggat gagtgcgcgg cgttgctcag tgacttcttt 
cgcatgcgcc gccaggaaat taaagcgcag aaaaaagcgc aatcctcgac ggattaa 



420 
480 
537 



<210> 18 
<211> 621 
<212> DNA 

<213> Escherichia coli 



<400> 18 

atgaggtata tagttgcctt aacgggaggc 
gcgtttgctg atctcggaat taacgtcatt 
gaaccaggtg cacctgcgct acatgccatt 
gctgatggaa cattgcagcg ccgggccttg 
aaaaactggc ttaacgccct gctgcatccg 
cagcaagcta cttcccccta tgtactgtgg 
tataaaaaag cgaatcgagt gcttgtggtg 
accatgcagc gcgatgatgt aactcgcgag 
acgcgcgaag cccgccttgc cgtggcagat 
gctatcgcat cggatgttgc ccgcctgcac 
gtctcacagg aaaaaccgta a 



attggcagtg gcaagagtac cgttgccaat 60 
gatgccgata ttattgcgcg tcaggtggtt 120 
gctgatcact ttggcgctaa catgattgct 180 
cgcgagcgga tcttcgccaa cccggaagag 240 
ctgattcagc aagagacgca acaccagatc 300 
gttgtgccat tgctggtaga aaactcactg 360 
gatqtcagcc cagaaacgca acttaagcgc 420 
catgtcgaac aaatccttgc tgctcaggca 4 80 
gacgtcattg ataataacgg cgcaccggat 540 
gcacactatt tgcagcttgc gtcgcagttt 600 

621 



<210> 19 
<211> 852 
<212> DNA 

<213> Escherichia coli 
<400> 19 

atgcggacac agtggccctc tccggcaaaa cttaatctgt ttttatacat taccggtcag 60 
cgtgcggatg gttaccacac gctgcaaacg ctgtttcagt ttcttgatta cggcgacacc 120 
atcagcattg agcttcgtga cgatggggat attcgtctgt taacgcccgt tgaaggcgtg 180 
gaacatgaag ataacctgat cgttcgcgca gcgcgattgt tgatgaaaac tgcggcagac 240 
agcgggcgtc ttccgacggg aagcggtgcg aatatcagca ttgacaagcg tttgccgatg 300 
ggcggcggtc tcggcggtgg ttcatccaat gccgcgacgg tcctggtggc attaaatcat 360 
ctctggcaat gcgggctaag catggatgag ctggcggaaa tggggctgac gctgggcgca 420 
gatgttcctg tctttgttcg ggggcatgcc gcgtttgccg aaggcgttgg tgaaatacta 480 
acgccggtgg atccgccaga gaagtggtat ctggtggcgc accctggtgt aagtattccg 540 
actccggtga tttttaaaga tcctgaactc ccgcgcaata cgccaaaaag gtcaatagaa 600 
acgttgctaa aatgtgaatt cagcaatgat tgcgaggtta tcgcaagaaa acgttttcgc 660 
gaggttgatg cggtgctttc ctggctgtta gaatacgccc cgtcgcgcct gactgggaca 720 
ggggcctgtg tctttgctga atttgataca, gagtctgaag cccgccaggt gctagagcaa 780 
gccccggaat ggctcaatgg ctttgtggcg aaaggcgcta atctttcccc attgcacaga 840 
gccatgcttt aa 



<210> 20 
<211> 696 
<212> DNA 

<213> Escherichia coli 
<400> 20 

atgcgacttg ataaatttat cgcacagcaa ctcggcgtta' gccgtgctat tgccgggcgt 60 
gaaatccgcg gcaatcgtgt caccgtcgat ggcgaaatcg tccgtaatgc agcgttcaaa 120 
ctgcttcctg aacatgatgt cgcttacgat ggcaacccgc tggcgcagca acacggtcca 180 
cgttacttca tgctcaataa gcctcagggc tatgtttgct ccacggacga ccctgatcac 240 
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ccaacggtgc 
ttggatattg 
attacttctc 
gctgacgata 
acuaagcctg 
gggcgttatc 
catcgtgaac 
ccgttaactg 



tctartttct 
ataccaccgg 
cgcgccatca 
cggcagagca 
cggtgctgga 
atcaggtgaa 
gtattggcgg 
aagaagaaat 



tgatgaaccg 
tctggtgctg 
ttgcgagaag 
atttgctaaa 
agtgattacc 
acgcatgtt c 
tattacgctg 
tgccagcgtc 



gtagcgtgga 
atgactgatg 
acctatctgg 
ggcgtgcagc 
ccaacgcagg 
gccgccgtgg 
gatgctgatt 
gtctaa 



aactgcatgc 
atggtcagtg 
tgacactgga 
tgcataacga 
ttcgtctgac 
gtaaccacgt 
tagcccccgg 



ggcggggcgg 
ctcgcaccgc 
atcacctgta 
aaaagatctc 
catcagcgaa 
ggttgagctg 
tgaatatcgt 



300 

360 

420 

480 

540- 

600 

660 

696 



<210> 21 
<211> 885 
<212> DNA 

<213> Escherichia coli 
<400> 21 

atgattatgc cgcaacatga ccaattacat 
ggcgaactgg taaccgtttc ggaaaccctg 
cagcccgtta aaaacgtgct ggcagaactg 
ctgaagtttg atggtgatat caccgtacag 
gttattaacg gtaacaataa ccagcagatg 
ccagaaaatg ccgacctgaa aacgctggtc 
ccgagcgaag gcgaacgcta tcagggcgta 
tgcctggaag attactttat gcgttctgaa 
ggcgacgtag acggcaaacc ggctgcaggc 
aatgcccagc aggacgactt tgaccacctg 
gaactgctga ccttaccggc aaacgaagtg 
acggtttacg atccgcagga tgtggagttc 
gatgcgctga aaacgctgcc tgatgaagaa 
attgacatgc attgtgatta ctgcggtaac 
gaaatccgca acaacgcgtc tccggcagat 



cgctatctgt ttgaaaactt tgccgtgcgc 60 

caacagatcc ttgagaacca cgattatccg 120 

ctggttgcga ccagcctgtt aaccgctacg 180 

ctgcagggcg acggtccgat gaatctggcg 240 

cgcggtgtgg cgcgcgtgca gggcgaaatt 300 

gccaatggtt acgtggtgat caccattacc 360 

gtnggtctgg aaggtgatac cctggcggcc 420 

cagctgccga cgcgcctgtt tattcgcacc 480 

ggtatgttgt tgcaggtaat gcctgcgcaa 540 

gcgacgctaa ccgaaaccat caaaaccgaa 600 

ttgtggcgtt tgtatcacga agaagaggtg 660 

aaatgcacct gctcgcgtga acgttgcgcc 720 

gttgatagca tcctggcgga agatggcgaa 780 

cactatctgt tcaatgcgat ggatattgct 840 
ccgcaagttc attaa 885 



<210> 22 
<211> 759 
<212> DNA 

<213> Escherichia coli 
<400> 22 

gtggggagac gacgcggatt tttaactatg 
ctgaccagcc attctcacat cgcgctttgc 
ctgcgcatgg ggccggggca ggcgttgcaa 
gccgaaatta ccagcgccag caaaaaaagc 
gacgatcgcg aatctccgct gcatattcac 
atggaattta ctatccagaa atcgatcgaa 
tctgagcgct gcggcgttaa actggatagt 
cagaagattg caattgctgc ctgtgagcag 
ccagcgatgg atctggaagc ctggtgtgca 
cacccgcgcg ccagtaacag catcaatacg 
ctgattggcc cggaaggcgg tttatcggca 
tttactgata tcctgttggg acctcgcgtt 
attaccgcgc tacaagtacg atttggcgat 



cgtatccccc gcatttatca tcctgaacca 60 

gaagatgccg ccaaccatat cgggcgcgta 120 

ttgtttgacg gtagcaacca ggtctttgac 180 

gtggaagtga aggtgctgga aggccagatc 240 

ctcggtcagg tgatgtcgcg tggtgaaaaa 300 

ctcggtgtaa gcctcattac gccacttttt 360 

gaacgtctga acaagaagct tcagcagtgg 420 

tgtggtcgta accgggtgcc ggaaatccgt 4 80 

gagcaggatg aaggactgaa actgaatctt 540 

ttgccgttac cggttgaacg cgtccgcctg 600 

gatgaaattg ccatgactgc ccgctatcaa 660 

ttgcgtacag agacaactgc gctcaccgcc 720 

ttgggctaa 759 



<210> 23 
<211> 462 
<212> DNA 

<213> Escherichia coli 
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<400> 23 

atgatgaatc gagtaattcc gctccctgat gagcaggcaa cattagacct gggcgagcgg 60 
gtagcgaaag cctgcgatgg cgcaaccgta atctatctgt atggcgattt aggcgcaggt 120 
aaaaccacct ttagccgggg ctttttacag gctctgggtc atcagggtaa tgtcaaaagc 180 
cccacttata cgctggccga accctatacg ctcgacaact taatggtcta tcactttgat 240 
ttgtaccgcc ttgccgatcc cgaggagctg gagtttatgg ggatccgcga ttattttgcc 300 
aacgatgcca tctgcctggt ggagtggcca caacaaggta caggtgttct tcctgacccg 360 
gatgtcgaaa tacacattga ttatcaggca caaggccgtg aggcgcgcgt gagtgcggtt 420 
tcctctgcgg gtgaattctt gctggcgcgt ttacccggtt aa 462 



<210> 24 
<21i> 987 
<212> DMA 

<213> Escherichia coli 
<400> 24 

atgaaattac gctctgtaac ctacgcatta 
tcttctctgg cggcacaatc tttacgtttc 
catattgcgg cgaaaaaatt caatgattta 
ttaaaactgt tcccggacag cactctcggt 
ggcggcacca tcgatatgga aatgtccggc 
atgaacttgc ttgatgtccc tttcctgttc 
gacggcaaag tcggtgatga tctgaaagcc 
gcctactggg aaaacggctg gcgcgatgtc 
gccgacctga aagggctgaa aatccgcacc 
aaagtctttg gcgctaaccc gatcccgatg 
acccgcacta tcgacgctca ggaacacccg 
gaagtgcaga agttcctttc tctgacgcac 
aacaaagcga agtttgatgg cttaagtccg 
caagaagcgg gtaactatca gcgcaaactg 
ggcatgaaag aagcgggcgt ggaagtcatc 
gcactgggga atcaggttcg cgacatgttt 
ctgaaagccg tggatgaggt gcaataa 



ttcattgctg gcctggctgc attcagcaca 60 
ggttatgaaa catcacaaac cgactcgcaa 120 
ttgcaggaga gaaccaaagg cgagctgaaa 180 
aacgcgcagg cgatgatcag cggcgtacgt 240 
tcgaataact ttgccgggtt atcaccagtg 300 
cgcgataccg ctcacgcgca taaaacgctc 360 
tcacttgaag gtaaaggact gaaagtactg 420 
accaactcgc gcgcaccggt taaaaccccc 480 
aacaatagcc cgatgaatat cgccgcattc 540 
ccgtttgccg aagtctatac cgggctggaa 600 
atcaacgtcg tctggtcagc aaaatttttc 660 
cacgcctatt ccccgcttct ggtggtgatc 720 
gagttccagc aggcgctagt ttcatctgca 780 
gttgctgaag atcagcaaaa aatcatcgac 840 
accgatctcg accgcaaagc ctttagcgac 900 
gttaaagatg tgccgcaggg agctgatctg 960 

987 



<210> 25 
<211> 573 
<212> DNA 

<213> Escherichia coli 
<4aO> 25 

gtgaataata acctgcaaag agacgctatc 
cgtgtcatcg cctatccaac ggaagccgtt 
acagcagtga tgcgactgtt ggagttaaaa 
atcgcagcaa attacgagca gcttaaaccc 
cagcgtgaaa ccattttttc ccgctggcca 
gcgacaacac cgcgctggtt gacgggccgc 
catccgttgg tggttgcttt gtgccaggct 
aacttgagtg gattgccacc ttgtcgaaca 
gcgttcccgg ttgtgcctgg tgaaacgggg 
gccctgacgg gtgaactgtt tcgacagggg 



<210> 26 
<211> 459 
<212> DNA 

<213> Escherichia coli 



gcagctgcga tagatgttct caatgaagaa 60 
ttcggtgttg ggtgcgatcc tgatagcgaa 120 
cagcgtccgg ttgataaggg gctgatttta 180 
tatattgatg acaccatgtt gactgacgtg 240 
ggtcctgtca cctttgtctt tcccgcgcct 300 
tttgattcgc ttgctgtacg agtcaccgac 360 
tatggtaaac cgctggtttc taccagtgcc 420 
gtagacgaag ttcgcgcaca atttggcgcg 480 
qqgcgtttaa atccttcaga aatccgcgat 540 
taa 573 
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<400> 26 

gtgggcttgt 

ctgggttttg 

tatattgata 

agtgctgtgc 

ccgggtctcg 

gtgactctgg 

gcggtagacg 

agtaatatcc 



ccacattaga 
aactggttgg 
gtgaagatgg 
tggatgttga 
atcgcccact 
ttctccgtat 
gtgaaatgat 
agaaggcgaa 



gcaaaaatta 
catcgaattt 
catcaatgtt 
agatcccatc 
gttcacggct 
ggcggtacaa 
cacagttacc 
cctggttccc 



acagagatga 
attcgcggtc 
gatgattgtg 
accgttgctt 
gaacactacg 
aaccgtcgta 
gtcgaaggta 
cacttttaa 



ttactgcgcc 
gcacatccac 
ctgatgtgag 
ataacctgga 
cccgttttgt 
aatggcaggg 
aagatgaagt 



agttgaggcc 
actgcgcatc 
ccaccaggta 
agtctcctca 
cggagaagag 
cgttatcaaa 
gttcgcgctg 



60 

120 

180 

240- 

300 

360 

420 

459 



<210> 27 
<211> 711 
<212> DNA 

<213> Escherichia coli 



<4a0> 27 

atggcaacca 

atgcaaacgg 

tcggtgcatg 

ggcgatagcc 

ggcggtgatg 

tgggtattgg 

ttggcgttga 

atgaaacgtg 

tggcacgcgc 

gctctaaatg 

catcctcagt 

gcactggccg 



ctcatttgga 
aatgtcctaa 
cgctgctggc 
gttttgcaca- 
agcgtgccga 
tgcatgacgc 
gcgaaaccag 
ccgaaccggg 
tgacgccgca 
aaggcgcgac 
tggtcgaagg 
agttttacct 



tgtttgcgcc 
gcaatatctc 
gcatccccgg 
acttcctctg 
ttccgtgctg 
cgctcgtcct 
ccgcacgggg 
caaaaatgcc 
atttttccct 
tattaccgac 
ccgtgcggat 
cacccgaacc 



gtggttccgg 
tcaatcggta 
gtgaaacgtg 
gcgaatcatc 
gcaggtctga 
tgtttgcatc 
gggatcctcg 
attgctcata 
cgtgagctgt 
gaagcctcgg 
aacattaaag 
atccatcagg 



cggccggatt 
atcaaaccat 
tcgtcattgc 
cgcaaatcac 
aagccgctgg 
aggatgacct 
ccgcaccagt 
ccgttgatcg 
tacatgactg 
cgctggaata 
tcacgcgccc 
agaatacata 



tggccgtcga 60 
tcttgaacac 120 
cataagtcct 180 
cgttgtagat 240 
cgacgcgcag 300 
cgcgcgattg 3 60 
gcgcgatact 420 
caacggctta 480 
tctgacgcgc 540 
ttgcggattc 600 
ggaagatttg 660 
a 711 



<210> 28 
<211> 468 
<212> DNA 

<213> Escherichia coli 
<400> 28 

atgagtcagg tgatcctcga tttacaactg gcatgtgaag ataattccgg gttaccggaa 60 
gagagccagt ttcagacatg gctgaatgcg gtgatcccgc agtttcagga agaatcggaa 120 
gtgacgattc gcgtggtcga taccgccgaa agccacagtc tgaatctgac ctatcgcggt 180 
aaggataagc cgaccaacgt gctctccttc ccgtttgaag tgccgcctgg catggaaatg 240 
tcgctactgg gcgatctggt tatctgccgt caggtggttg agaaggaagc tcaggagcaa 300 
ggcaaaccac tggaggcgca ctgggcgcat atggtggtgc acggcagtct gcatttgtta 360 
ggttacgatc acatcgaaga tgacgaagca gaagaaatgg aagccctcga aacagagatt 420 
atgcttgctc tgggctatga ggatccgtac attgccgaga aagaataa 4 68 



<210> 29 
<211> 1119 
<212> DNA 

<213> Escherichia coli 
<400> 29 

atgcataacc aggctccaat tcaacgtaga aaatcaacac gtatttacgt tgggaatgtg 60 
ccgattggcg atggtgctcc catcgccgta cagtccatga ccaatacgcg tacgacagac 120 
gtcgaagcaa cggtcaatca aatcaaggcg ctggaacgcg ttggcgctga tatcgtccgt 180 
gtatccgtac cgacgatgga cgcggcagaa gcgttcaaac tcatcaaaca gcaggttaac 240 
gtgccgctgg tggctgacat ccacttcgac tatcgcattg cgctgaaagt agcggaatac 300 
ggcgtcgatt gtctgcgtat taaccctggc aatatcggta atgaagagcg tattcgcatg 360 
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gtggttgact 
ctggaaaaag 
gccatgcgtc 
aaagcgtctg 
cagccgtrgc 
gccattggtt 
gcggccgatc 
cgttcgcgag 
atcggtacgg 
tcgattatcg 
accggcggca 
aacaacgata 
gaagcgcgtc 



gtgcgcgcga 
atctgcaaga 
atgttgatca 
acgtcttcct 
atctggggat 
taggtctgct 
cggtcgaaga 
ggatcaactt 
ttaacgcgct 
gctgcgtggt 
acaagaaaag 
tgatcgacca 
gaattgacgt 



taaaaacatt 
aaagtatggc 
tctcgatcgc 
cgctcttgag 
caccgaagcc 
gctgtctgaa 
gatcaaagtc 
catcgcctgc 
ggagcaacgc 
gaatggccca 
cggcctctat 
gctggaagca 
tcagcaggtt 



ccgatccgta 
gaaccgacgc 
ctgaacttcg 
tcttatcgtt 
ggtggtgcgc 
ggcatcggcg 
ggtttcgata 
ccgacctgtt 
ctggaagata 
ggtgaggcgc 
'gaagatggcg 
cgcattcgtg 
gaaaaataa 



ttggcgttaa 
cgcaggcgtt 
atcagttcaa 
tgctggcaaa 
gcagcggggc 
acacgctgcg 
ttttgaaatc 
cgcgtcagga 
tcatcactcc 
tggtttctac 
tgcgcaaaga 
cgaaagccag 



cgccggatcg 
gctggaatct 
agtcagcgtg 
acagatcgat 
agtaaaatcc 
cgtatcgctg 
gctgcgtatc 
atttgatgtt 
gatggacgtt 
actcggcgtc 
ccgtctggac 
tcagctggac 



420 

480 

540 

600 

660"- 

720 

780 

840 

900 

960 

1020 

1080 

1119 



<210> 30 
<211> 480 
<212> DNA 

<213> Escherichia coli 
<400> 30 

atgcaaaaac gggcgattta tccgggtact ttcgatccca ttaccaatgg tcatatcgat 60 
atcgtgacgc gcgccacgca gatgttcgat cacgttattc tggcgattgc cgccagcccc 120 
agtaaaaaac cgatgtttac cctggaagag cgtgtggcac tggcacagca ggcaaccgcg 180 
catctgggga acgtggaagt ggtcgggttt agtgatttaa tggcgaactt cgcccgtaat 240 
caacacgcta cggtgctgat tcgtggcctg cgtgcggtgg cagattttga atatgaaatg 300 
cagctggcgc atatgaatcg ccacttaatg ccggaactgg aaagtgtgtt tctgatgccg 360 
tcgaaagagt ggtcgtttat ctcttcatcg ttggtgaaag aggtggcgcg ccatcagggc 420 
gatgtcaccc atttcctgcc ggagaatgtc catcaggcgc tgatggcgaa gttagcgtag 480 



<210> 31 
<211> 699 
<212> DNA 

<213> Escherichia coli 



<400> 31 

atgaaaatcg gcatcattgg tgcaatggaa 
gaaaaccgtc aaactatcag tctcggcggt 
accgaggttg cgcttctgaa atcgggcatc 
"ttgctgttgg aacactgcaa gccagatgtg 
gcaccaacgt tgaaagtggg cgatatcgtt 
gatgtcacgg catttggtta tgaatacggt 
gctgacgata aactgatcgc tgccgctgag 
gtacgtggcc tgattgttag cggcgacgct 
atccgccaca acttcccaca ggccattgct 
gtctgccaca atttcaacgt cccgtttgtt 
caacagtctc atcttagctt cgatgagttc 
atggttgagt cactggtgca gaaacttgca 



gaagaagtta cgctgctgcg tgacaaaatc 60 
tgcgaaatct ataccggcca actgaatgga 120 
ggtaaagtcg ctgcggcgct gggtgccact 180 
attattaaca ccggttctgc cggtggcctg 240 
gtctcggacg aagcacgtta tcacgacgcg 300 
cagttaccag gctgtccggc aggctttaaa 360 
gcctgcattg ccgaactgaa tcttaacgct 420 
ttcatcaacg gttctgttgg tctggcgaaa 480 
gta.gagatgg aagcgacggc aatcgcccat 540 
gtcgtacgcg ccatctccga cgtggccgat 600 
ctggctgttg ccgctaaaca gtccagcctg 660 
catggctaa 699 



<210> 32 
<211> 1344 
<212> DNA 

<213> Escherichia coli 
<400> 32 

gtgagcaatc tgtcgctcga tttttcggat aatacttttc aacctctggc cgcgcgtatg 60 
cggccagaaa atttagcaca gtatatcggc cagcaacatt tgctggctgc ggggaagccg 120 
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ttgccgcgcg c~atcgaagc cgggcattta 
accggcaaaa caactctcgc tgaagtgatt 
atttctgccg tcacctctgg cgtgaaagag 
aaccgcaatg caggtcgccg cactattctt 
agccagcagg atgcatttct gcca'catatt 
accactgaaa acccgtcgtt tgagcttaat 
ctgttgaaat ccctgagtac agaggatatt 
aaaacccgtg gctatggtgg tcaggatatt 
gctgaactgg tgaatggcga cgcgcgccgg 
atggccgaag tcgatgatag cggtaagcgg 
gccggtgaac gtagcgcccg ctttgataac 
gcactgcata agtcggtacg tggtagcgca 
attattaccg ctggtggcga tccgttatat 
gaagacgtcg gtaatgccga tccacgggcg 
tttactcgcg ttggcccggc ggaaggtgaa 
gcctgcgcgc caaaaagcaa cgctgtctac 
cgcgaacgcc cggattatga cgtgccggtt 
aaggaaatgg gctacgggca ggaatatcgt 
gccggtgagg tttacttccc gccggaaata 
aggggccttg aaggcaagat tggcgaaaag 
agccccataa aacgctaccg ttaa 



10 

cattctatca tcctctgggg gccgccgggt 180 
gcccgctatg cgaacgctga rgtggaacgt 240 
attcgcgagg cgatcgagcg cgcccggcaa 300 
tttgttgacg aagttcaccg tttcaacaaa 360 
gaagacggca ccatcacttt tattggcgca 420 
tcggcactgc tttcccgtgc ccgtgtctat 480 
gagcaagtac taactcaggc gatggaagac 540 
gttctgccag atgaaacacg acgcgccatt 600 
gcgttaaata cgctggaaat gatggcggat 660 
gtcctgaagc ctgaattact gaccgaaatc 720 
aaaggcgatc gcttttacga tctgatttcc 780 
cccgatgcgg cgctgtactg gtatgcgcga 840 
gtcgcgcgtc gctgtctggc gattgcgtct 900 
atgcaggtgg caattgcggc ctgggattgc 960 
cgcgccattg ctcaggcgat tgtttacctg 1020 
actgcgttta aagccgcgct ggccgatgct 1080 
catttgcgta atgcgccgac gaaattaatg 1140 
tacgctcatg atgaagcaaa cgcttatgct 1200 
gcacaaacac gctattattt cccgacaaac 1260 
ctcgcctggc tggctgaaca ggatcaaaat 1320 

1344 



<210> 33 
<211> 1911 
<212> DNA 

<213> Escherichia coli 
<400> 33 

gtgacggacg attttgcacc agacggtcag 
cgagaaccac agcgacagat ggcggtagcc 
ctggtggtgg aagcaggaac cggtacgggc 
cgggcgaaaa agaaagtcat tatctcgacc 
agccgcgatt tgccaacagt ctcaaaggca 
aaagggcgct caaactacct ctgcctcgaa 
gatctgccgg tacaaatctt aagcgatgtg 
gtcgatggtg atatcagcac ctgcgtcagc 
gtcaccagca ccaacgacaa ctgtcttggc 
gtggtcaaag cacgtaaaaa agcgatggac 
tttctggcgg atatggtggt taaagagagt 
gtcatgatct tcgacgaagc ccaccagcta 
tcactctcca gtcgacaact gctcgacctg 
gaattaaaag acacccagca gttacaaaag 
gattttcgtc tgcaactcgg tgagccaggt 
aatccgcaaa ttcagcgggc atttttactg 
gtggcgaaac tgtcactggg gcgttccgcc 
ttgtatcgca cacggctgaa gcggctaaaa 
tacgaatgca cttcgcgcca ttttactctg 
ttcaaagagt taatggcgca aaaacccggt 
gtgaacgacg atctgcatca tttcacctcg 
ctgttgccca gcccatttga tt'acagccgc 
ccgcaaacca accagccagg ttctgctcgc 
gaagctaaca acggtcgttg ttttatgctt 
gccgagcagt tccgcgctac catgacgctt 
gggcaactgt t.gcagcaatt tgtcagcgcc 
ttctgggaag gggtggacgt gcgtggcgat 
ccgtttacct cgccggatga tccactgtta 
ggtggcgacc cgttcgatga agtgcaacta 
gtagggcgac tgattcgcga cgccgacgat 



ctggcgaaag cgataccagg ctttaagccg 60 

gtcacccagg cgatagaaaa aggccagccg 120 

aaaacctacg cttacctggc tcctgcgctg 180 

ggctcaaaag cgttgcagga tcagctctac 240 

ttgaaatata cgggcaacgt ggcgctgctg 300 

cgtctcgaac agcaggcgct ggcggggggc 360 

atcctgctgc gctcctggtc taatcaaaca 420 

gtggcggaag attcacaggc gtggccgctg 480 

agcgactgcc cgatgtataa agattgcttt 540 

gccgatgtgg tggtggtaaa ccatcatctc 600 

ggatttggcg aactgatccc ggaagcggac 660 

ccggacattg ccagccagta ttttggtcag 720 

gcaaaagaca tcaccatcgc ctaccgcacc 780 

tgcgctgatc gtcttgccca gagtgcgcag 840 

tatcgcggta acctgcgtga gctgttagct 900 

ctcgatgaca ccctggaact ttgttatgac 960 

ttgctggatg cggcatttga gcgcgccacg 1020 

gagatcaatc agccgggcta cagctactgg 1080 

gctctcacgc cgctcagcgt ggcggataaa 1140 

agctggatct tcacctcagc aacgctgtcg 1200 

cggcttggca tcgaacaggc cgagtcgttg 1260 

caggcgttac tctgtgtgct gcgcaatctg 1320 

cagttagcgg caatgctgcg accgatcatc 1380 

tgtacctcgc acgccatgat gcgcgatctg 144a 

cctgtattgt tgcaggggga aaccagcaaa 1500 

ggtaatgcgc ttcttgtggc aaccagcagt 1560 

acattgtcat tggtaattat cgacaaattg 1620 

aaagcgcgca tggaagattg tcgtttgcgc 1680 
ccagatgccg tcattactct caaacagggg 1740 
cgtggcgtgc tggtgatttg tgacaatcgg 1800 
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ctggtgatgc gtccttacgg cgcgacgttt ctcgccagtc tgccgcccgc gccacgcacc 1860 

cgtgacattg cccgtgcggt tcgtttcctt gcgataccat cctccaggta a 1911 

<210> 34 
<211> 414 
<212> DNA 

<213> Escherichia coli 
<400> 34 

atggctaata aaccttcggc agaagaactg aaaaaaaatt tgtccgagat gcagttttac 60 

gtgacgcaga atcatgggac agaaccgcca tttacgggtc gtttactgca taacaagcgt 120 

gacggcgtat atcactgttt gatctgcgat gccccgctgt ttcattccca aaccaagtat 180 

gattccggct gtggctggcc cagtttctac gaaccggtaa gtgaagaatc cattcgttat 240 

atcaaagact tgtcacatgg aatgcagcgc atagaaattc gttgcggtaa ctgtgatgcc 300 

catctggggc atgtcttccc cgacgggccg cagccaacgg gcgaacgtta ttgtgttaac 360 

tctgcctctt tacgctttac cgatggcgaa aacggcgaag aaatcaacgg ttga 414 



<210> 35 
<211> 1968 
<212> DNA 

<213> Escherichia coli 
<400> 35 

atgaccattg agaaaatttt caccccgcag gacgacgcgt tttatgcggt gatcacccac 60 
gcggcggggc cgcagggcgc . tctgccgctg accccgcaga tgctgatgga atctcccagc 120 
ggcaacctgt tcggcatgac ' gcagaacgcc gggatgggct gggacgccaa caagctcacc 180 
ggcaaagagg tgctgattat cggcactcag ggcggcatcc gcgccggaga cggacgccca 240 
atcgcgctgg gctaccacac cgggcattgg gagatcggca tgcagatgca ggcggcggcg 300 
aaggagatca cccgcaatgg cgggatcccg ttcgcggcct tcgtcagcga tccgtgcgac 360 
gggcgctcgc agggcacgca cggtatgttc gattccctgc cgtaccgcaa cgacgcggcg 420 
atcgtgtttc gccgcctgat ccgctccctg ccgacgcggc gggcggtgat cggcgtagcg 480 
acctgcgata aagggctgcc cgccaccatg attgcgctgg ccgcgatgca cgacctgccg 540 
actattctgg tgccgggcgg ggcgacgctg ccgccgaccg tcggggaaga cgcgggcaag 600 
gtgcagacca tcggcgcgcg tttcgccaac cacgaactct ccctgcagga ggccgccgaa 660 
ctgggctgtc gcgcctgcgc ctcgccgggc ggcgggtgtc agttcctcgg cacggcgggc 720 
acctcgcagg tggtcgcgga ggcgctgggt ctggcgctgc cgcactccgc gctggcgccg 780 
tccgggcagg cggtgtggct ggagatcgcc cgccagtcgg cgcgcgcggt cagcgagctg 840 
gatagccgcg gcatcaccac gcgggatatc ctctccgata aagccatcga aaacgcgatg 900 
gtgatccacg cggcgttcgg cggctccacc aatttactgc tgcacattcc ggccatcgcc 960 
cacgcggcgg gctgcacgat cccggacgtt gagcactgga cgcgcatcaa ccgtaaagtg 1020 
ccgcgtctgg tgagcgtgct gcccaacggc ccggactatc acccgaccgt gcgcgccttc 1080 
ctcgcgggcg gcgtgccgga ggtgatgctc cacctgcgcg acctcggcct gctgcatctg 1140 
gacgccatga ccgtgaccgg ccagacggtg ggcgagaacc ttgaatggtg gcaggcgtcc 1200 
gagcgccggg cgcgcttccg ccagtgcctg cgcgagcagg acggcgtaga gccggatgac 1260 
gtgatcctgc cgccggagaa ggcaaaagcg aaagggctga cctcgacggt ctgcttcccg 1320 
acgggcaaca tcgctccgga aggttcggtg atcaaggcca cggcgatcga cccgtcggtg 1380 
gtgggcgaag atggcgtata ccaccacacc ggccgggtgc gggtgtttgt ctcggaagcg 1440 
caggcgatca aggcgatcaa gcgggaagag attgtgcagg gcgatatcat ggtggtgatc 1500 
ggcggcgggc cgtccggcac cggcatggaa gagacctacc agctcacctc cgcgctaaag 1560 
catatctcgt ggggcaagac ggtgtcgctc atcaccgatg cgcgcttctc gggcgtgtcg 1620 
acgggcgcct gcttcggcca cgtgtcgccg gaggcgctgg cgggcgggcc gattggcaag 1680 
ctgcgcgata acgacatcat cgagattgcc gtgga.tcgtc tgacgttaac tggcagcgtg 1740 
aacttcatcg gcaccgcgga caacccgctg acgccggaag agggcgcgcg cgagctggcg 1800 
cggcggcaga cgcacccgga cctgcacgcc cacgactttt tgccggacga cacccggctg 1860 
tgggcggcac tgcagtcggt gagcggcggc acctggaaag gctgtattta tgacaccgat 1920 
aaaattatcg aggtaattaa cgccggtaaa aaagcgctcg gaatttaa 1968 
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<21C> 36 
<211> 717 
<212> DNA 

<213> Escherichia coli 
<400> 36 

gtgggacgta aatgggccaa tattgttgct aaaaaaacgg ctaaagacgg tgcaacgtct 60 

aaaatttatg caaaattcgg tgtagaaatc tatgctgctg ctaaacaagg tgaacccgat 120 

ccagaattaa acacatcttt aaaattcgtt attgaacgtg caaagcaggc acaagttcca 180 

aagcacgtta ttgataaagc aattgataaa gccaaaggcg gcggagatga aacgttcgtg 240 

cagggacgtt atgaaggctt tggtcctaat ggctcaatga ttatcgccga gacattgact 300 

tcaaatgtta accgtacgat tgctaacgtt cgcacaattt " tcaataaaaa aggcggcaat 360 

atcggagcgg caggttctgt cagctatatg tttgacaata cgggtgtgat tgtatttaaa 420 

gggacagacc ctgaccatat ttttgaaatt ttacttgaag ctgaagttga tgttcgtgat 4 80 

gtgactgaag aagaaggtaa cattgttatt tatactgaac ctactgacct tcataaagga 540 

atcgcggctc taaaagcagc tggaatcact gagttctcaa caacagaatt agaaatgatt 600 

gctcaatctg aagttgagct ttccccagaa gatttagaaa tctttgaagg gcttgttgat 660 

gcccttgaag atgacgacga tgtacaaaaa gtttatcata acgtcgcaaa tetctaa 717 



<210> 37 
<211> 258 
<212> DNA 

<213> Escherichia coli 



<400> 37 

atggcgccgc cactgtcgcc tggctcgcgg gtcctgatag ccctcattcg ggtctatcaa 60 

cgcctgatta gtccgctact cgggccgcat tgtcgtttca ctccaacctg ttcaagctac 120 

ggaattgagg cattgcgcag gtttggagtg ataaaaggca gttggttgac ggtgaaacgc 180 

gtattaaaat gccacccttt acaccctggt ggtgacgatc ccgtcccgcc cggaccattt 240 

gataccagag aacactaa 258 



<210> 38 , . 

<211> 1023 
<212> DNA 

<213> Escherichia coli 
<400> 38 

atgaaaaaag tgttattgat aatcttgtta ttgctggtgg tactgggtat cgccgctggt 60 
gtgggcgtct ggaaggttcg ccatcttgcc gacagcaaat tgcttatcaa agaagagacg 120 
atatttaccc tgaagccagg gaccggacgt ctggcgctcg gtgaacagct ttatgccgat 180 
aagatcatca atcgtccacg ggtttttcaa tggctgctgc gtatcgaacc ggatctttct 240 
cactttaaag ccgggactta ccgctttaca ccgcagatga ccgtgcgcga gatgctgaaa 300 
ttgctggaaa gcggtaaaga agcacagttc cctctgcgac tggtagaagg gatgcgtctg 360 
agcgattacc tcaagcaatt gcgtgaggcc ccgtatatca agcatacgct gagcgatgat 420 
aagtacgcca ccgtagcgca ggcacttgaa ctggaaaacc cggagtggat tgaaggttgg 4 80 
ttctggccag acacctggat gtataccgcc aataccaccg atgtcgcgtt actcaagcga 540 
gcgcacaaga aaatggtgaa agcggtcgat agcgcctggg aagggcgtgc ggacggtctg 600 
ccttataaag ataaaaacca gttggtgacg atggcatcaa ttatcgaaaa agaaaccgcc 660 
gttgccagtg aacgcgataa ggttgcctca gtatttatca accgtttacg cattggtatg 720 
cgcctgcaga ccgacccgac cgtgatttac gggatgggag agcgttataa tggcaaactt 780 
tctcgtgcag acctggaaac gccgacagcg tataacacct ataccattac cggtctgccg 840 
ccaggtgcga tagcgacgcc gggggcggat tcgctgaagg ctgctgcgca tccggcaaaa 900 
acgccgtatc tctattttgt ggccgatggt aaaggtggtc acacgtttaa taccaatctt 960 
gccagtcata acaagtctgt gcaggattat ctgaaagtgc ttaaggaaaa aaatgcgcag 1020 
taa 1023 
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<210> 39 
<211> 873 
<212> DNA 

<213> Escherichia coii 



<400> 39 

atgctgcccg 

cgccgcgaag 

accattggcg 

cctcgggaag 

accaccgaag 

ttcccgattg 

gatctggtga 

gtcgataaac 

gggacagtga 

ctggtgcagg 

aaaaagctgg 

tggcgcgatt 

tccgaggtaa 

gttaagatgg 

acctcgccgg 



actcatcagt 
cggatcgcta 
atcaggtgaa 
ccgaagattt 
atggcgagcg 
gccgcctgga 
ataagatcct 
cgattaccga 
ecaaaaagtg 
ggctgaaccg 
aacgcacgcg 
taaccgacga 
aacctaaagc 
aaaaaacggc 
ggcgtaaaaa 



ccgtttaaat 
tatcgagcaa 
acccggcgac 
ggtacttatc 
cgataacatt 
taaagactcc 
gcgtgctggc 
ggagtttatt 
caaagttaaa 
tcagatccgg 
catcatgaac 
tgagttaatc 
gaaggccaaa 
ggaaaaaggc 
gaaggggcgc 



aaatacatca 
ggcaatgtgt 
gttgtgaaag 
gccctgaaca 
gtcgatttcg 
caggggctga 
aatgatcatg 
cgcggcatga 
aaagaagcgc 
cgcatgtgcg 
gttagcttaa 
gacctcttta 
ccgaaaacag 
ggtcgcccgg 
tga 



gcgaaagcgg 
tccttaatgg 
taaacggtca 
agcccgttgg 
ttaaccacag 
ttttcctcac 
agaaagagta 
gtgcgggggt 
cgtttgtctt 
agcatttcgg 
gcggcattcc 
agctcattga 
cgggcatcaa 
cgtccaacgg 



aatttgctca 60 
caagcgagcc 120 
gttgattgaa 180 
tattgtaagc 240 
caaacgcgtg 300 
caatcacggc 360 
tctggtgacg 420 
gccaatcctc 480 
ccgcattacc 540 
ctatgaagtg 600 
gctgggggaa 660 
aaattcctct 720 
acgtccagtc 780 
taagcgtttt 840 
873 



<210> 40' 
<211> 159 
<212> PRT 

<213> Escherichia coli 
<400> 40 

Met Arg lie Gly His Gly Phe Asp Val His Ala Phe Gly Gly Glu Gly 
15 10 15 

Pro lie lie He Gly Gly Val Arg He Pro Tyr Glu Lys Gly Leu Leu 
20 25 30 

Ala His Ser Asp Gly Asp Val Ala Leu His Ala ..Leu Thr Asp Ala Leu 
35 40 45 

Leu Gly Ala Ala Ala Leu Gly Asp He Gly Lys Leu Phe Pro Asp Thr 
50 55 60 

Asp Pro Ala Phe Lys Gly Ala Asp Ser Arg Glu Leu Leu Arg Glu Ala 
65 70 75 80 

Trp Arg Arg He Gin Ala Lys Gly Tyr Thr Leu Gly Asn Val Asp Val 
85 '90 95 

Thr He He Ala Gin Ala Pro Lys Met Leu Pro His He Pro Gin Met 
100 105 110 

Arg Val Phe He Ala Glu Asp Leu Gly Cys His Met Asp Asp Val Asn 
. 115 120 125 

Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Gly Glu 
130 135 140 



Gly He Ala Cys Glu Ala Val Ala Leu Leu He Lys Ala Thr Lys 
145 150 155 
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<210> 41 
<211> 158 
<212> PRT 

<213> Haemophilus influenzae 
<400> 41 

Met lie Axg lie Gly His Gly Phe Asp Val His Ala Phe Gly Glu Asp 
15 10 15 

Arg Pro. Leu He He Gly Gly Val Glu Val Pro Tyr His Thr Gly Phe 
20 25 30 

lie Ala His Ser Asp Gly Asp Val Ala Leu His Ala Leu Thr Asp Ala 
35 - 40 45 

He Leu Gly Ala Ala Ala Leu Gly Asp He Gly Lys Leu Phe Pro Asp 
50 55 60 

Thr Asp Met Gin Tyr Lys Asn Ala Asp Ser Arg Gly Leu Leu Arg Glu 
65 70 75 80 

Ala Phe Arg Gin Val Gin Glu Lys Gly Tyr Lys He Gly Asn Val Asp 
. 85 90 95 

He Thr He He Ala Gin Ala Pro Lys Met Arg Pro His He Asp Ala 
100 105 110 

Met Arg Ala Lys He Ala Glu Asp Leu .Glh Cys Asp He Glu Gin Val. 
115 120 125 

Asn Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Gin 
130 135 140 

Glu Gly He Ala Cys Glu Ala Val Ala Leu Leu He Arg Gin 
145 150 155 



<210> 42 
<211> 158 
<212> PRT 

<213> Bacillus subtilis 
<400> 42 

Met Phe Arg He Gly Gin Gly Phe Asp Val His Gin Leu Val Glu Gly 
1 5 10 15 

Arg Pro Leu He He Gly Gly He Glu He Pro Tyr Glu Lys Gly Leu 
20 - . 25 30 

Leu Gly His Ser Asp Ala Asp Val Leu Leu His Thr Val Ala Asp Ala 
35 40 45 

Cys Leu Gly Ala Val Gly Glu Gly Asp He Gly Lys His Phe Pro Asp 
50 55 60 

Thr Asp Pro Glu Phe Lys Asp Ala Asp Ser Phe Lys Leu Leu Gin His 
65 70 75 80 
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Val Trp Gly He Val Lys Gin Lys Gly Tyr Vai Leu Gly Asn He- Asp 
85 90 95 

Cys Thr He He Ala Gin Lys Pro Lys Met Leu Pro Tyr He Glu Asp 
100 105 110 

Met Arg Lys Arg He Ala Glu Gly Leu Glu Ala Asp Val Ser Gin Val 
115 120 125 

Asn Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Ala 
130 135 140 

Glu Gly He Ala A.ia Gin Ala Thr Val Leu He Gin Lys Gly 
145 150 155 



<210> 43 
<211> 161 
<212> PRT 

<213> Synechocystis sp. 
<400> 43 

Met Thr Ala Leu Arg He Gly Asn Gly Tyr Asp He His Arg Leu Val 
15 10 15 

Gly Asp Arg Pro Leu He Leu Gly Gly Val Thr He Ala His His Leu 
20 25 30 

Gly Leu Asp Gly His Ser Asp Ala Asp Val Leu Thr His Ala Leu Met 
35 40 45 

Asp Ala Leu Leu Gly Ala Leu Ser Leu Gly Asp He Gly His Tyr Phe 
50 55 60 

Pro Pro Ser Asp Ala Arg Trp Gin Gly Ala Asp Ser Leu Lys Leu Leu 
65 70 75 80 

Ala Gin Val His Gin Leu He Leu Glu Arg Gly Trp Arg He Asn Asn 
85 90 95 

Leu Asp Asn Val He Val Ala Glu Gin Pro Lys Leu Lys Pro His He 
100 105 110 

Gin Ala Met Lys Glu Asn Leu Ala Lys Val Leu Thr He Asp Pro Asp 
115 120 125 

Leu He Gly He Lys Ala Thr Thr Asn Glu Arg Leu Gly Pro Thr Gly 
130 135 140 

Arg Glu Glu Gly He Ala Ala Tyr Ser Val Ala Leu Leu He Lys Glu 
145 150 155 160 

Gly 



<210> 44 
<211> 399 
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<212> PRT 

<213> Treponenia pallidum 
<400> 44 

Met Arg Arg Gly Gly Ala Cys Val Gin Lys Lys Glu Tyr Leu Pro Leu 
1 5 ■ 10 15 

Thr Ser Arg Gin Pro Gly Val Cys Leu Leu Ser Glu He Leu Val Arg 
20 25 30 

Ala Leu Glu' Ala Arg Ser Phe Phe Leu Val Val Val Thr Val Pro Ala 
35 40 45 

Gly Glu Val Ala Tyr Ala Glu Ser Gin Val Ala Cys Asp Ser Arg Leu 
50 55 60 

Ser Ala Phe Pro Ser Arg Thr Arg Pro Val He Leu Tyr Val Pro Gly 
65 70 75 80 

Ala His Thr Arg Ser Ala Ser Val Arg Ala Gly Leu Asp Ala Met Ala 
85 90 95 

Thr His Ala Pro Asp Val Val Leu Val His Asp Gly Ala Arg Pro Phe 
100 105 110 

Val Ser Val Ala Leu lie His Ser Val Leu Glu Ala Thr Cys Arg Tyr 
115 120 125 

Gly Ala Ala Val Pro Val lie Glu Ala Thr Asp Thr Pro Lys Gly Val 
130 135 140 

Ala Ala Asp Gly Ser He Glu Thr His Leu He Arg Ser Arg Val Arg 
145 , 150 155 160 

Leu Ala Gin Thr Pro Gin Gly Phe Cys Tyr Ala Ser Leu Cys Ala Ala 
165 170 175 

His His Arg Ala Ala Thr Asp Gly Glu Gin Tyr Thr Asp Asp Ser Glu 
180 185 190 

Leu Tyr Ala Arg Tyr Gly Gly Thr Val His Val Cys Ala Gly Glu Arg 
195 200 205 

Ser Asn Val Lys He Thr Tyr Pro Glu Asp Leu Glu Gin Arg Ala Ser 
210 215 220 

Glu Pro Ala Leu Thr Arg Gly He Ser Val Leu Pro Cys Thr Glu Glu 
225 230 235 240 

Gly Ala Leu Arg Val Gly Leu Gly Thr Asp Met His Ala Leu Cys Ala 
245 250 255 

Gly Arg Pro Leu He Leu Ala Gly He His He Pro Ser Lys Lys Gly 
260 265 270 

Ala Gin Gly His Ser Asp Ala Asp Val Leu Ala His Ala Ser He Asp 
275 280 285 



Ala Leu Leu Gly Ala Ala Gly Leu Gly Asp He Gly Thr Phe Phe Pro 
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290 295 

Ser Cys Asd Gly Arg Trp Lys Asp Ala 
305 310 

His Thr Trp Gin Leu Val Arg Ala Ala 
325 

Asp Ala Val Val Cys Leu Glu Gin Pro 
340 345 

Ala Met Arg Ala Ser Leu Ala Gin Ala 
355 360 

Val Phe Val Lys Ala Lys Thr Ala Glu 
370 375 

Gly Ala Ala. Val Thr Ala Gin Val Val 
385 390 
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300 

His Ser Cys Ala Leu Leu Arg 
315 320 

Cys Tro Arg Leu Val Asn Leu 
330 ' 335 

Ala Leu His Pro Phe Arg Glu 
350 

Leu Asp Thr His Val Thr Arg 
365 

Arg Leu Gly Pro Val Gly Ser 
380 

Val Leu Leu Lys Lys lie 
395 



<210> 45 
<211> 406 
<212> PRT 

<213> Helicobacter pylori 

<400> 45 , r^n 

Met Ser Leu He Arg Val Asn Gly Glu Ala Phe Lys Leu Ser Leu Glu 

1 5 10 

Ser Leu Glu Glu Asp Pro Phe Glu Thr Lys Glu Thr Leu Glu Thr Leu 
20 25 30 

He Lys Gin Thr Ser Val Val Leu Leu Ala Ala Gly Glu Ser Arg Arg 
35 40 45 

Phe Ser Gin Thr He Lys Lys Gin Trp Leu Arg Ser Asn His Thr Pro 
50 55 60 

Leu Trp Leu Ser Val Tyr Glu Ser Phe Lys Glu Ala Leu Asp Phe Lys 
65 70 75 80 

Glu He He Leu Val Val Ser Glu Leu Asp Tyr He Tyr He Lys Arg 
85 90 95 

His Tyr Pro Glu He Lys Leu Val Lys Gly Gly Ala Ser Arg Gin Glu 
100 105 HO 

Ser Val Arg Asn Ala Leu Lys He He Asp Ser Ala Tyr Thr Leu Thr 
115 120 125 

Ser Asp Val Ala Arg Gly Leu Ala Asn He Glu Ala Leu Lys Asn Leu 
130 135 140 

Phe Leu Thr Leu Gin Gin Thr Ser His Tyr Cys He Ala Pro Tyr Leu 
145 150 155 160 

Pro Cys Tyr Asp Thr Ala He Tyr Tyr Asn Glu Ala Leu Asp Arg Glu 
165 170 175 
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